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Abstract 

This  paper  begins  by  summarizing  the  micro-evidence  on  credit 
markets  from  a  large  number  of  studies  from  all  over  the  world,  with 
the  goal  of  identifying  a  number  of  stylized  fcicts.  We  argue  that, 
in  particular,  the  evidence  strongly  suggests  that  for  poor  people  in 
developing  countries,  imperfections  in  the  credit  market  are  quantita- 
tively very  important.  We  then  build  a  simple  model  that  explains  the 
observed  patterns,  based  on  the  idea  that  monitoring  and  screening 
borrowers  have  both  fixed  and  variable  costs.  We  go  on  to  build  a 
simple  dynamic  model  that  allows  us  to  understand  what  the  obser- 
vations about  the  credit  market  imply  for  the  evolution  of  the  wealth 
distribution. 

JEL  Codes:  012,  D82,  D31 

Kejrwords:  Credit  Markets;  Distribution;  Growth 


1      Introduction 

Development  economists  are,  perhaps  by  necessity,  optimistic  people.  One 
does  not  become  a  development  economist  if  one  believes  that  the  world's 
poorest  are  doing  as  well  as  they  possibly  could.  Indeed  the  premise  of  the 
entire  field'  is  that  there  is  talent  in  every  people,  if  not  every  person,  and  if 

'This  paper  grew  out  a  lecture  I  gave  at  the  World  Congress  of  the  Econometric  Society 
in  August  2000.  I  am  grateful  to  Philippe  Aghion  and  Esther  Duflo  for  their  comments 
and  Marko  Tervio  for  his  help  with  simulations. 

And  of  growth  theory:  What  is  convergence  other  than  the  hope  that  there  is  talent 
in  every  nation? 


there  is  one  central  question  it  has  to  be  what  prevents  people  from  making 
the  best  use  of  their  natural  talents? 

There  are  at  least  five  distinct  answers  to  this  question.  The  first,  which 
is  elaborated  here,  is  the  answer  from  contract  theory:  Talent  is  not  an 
apple,^  one  cannot  simply  go  to  the  market,  sell  one's  talent  and  expect  to 
be  paid  the  appropriate  price.  The  second  is  coordination  failure:  Talent  is 
only  talent  if  it  gets  to  work  with  the  appropriate  other  inputs.  Even  Lennon 
needed  Paul  and  George — had  they  decided  to  go  to  the  City  instead,  he 
too  might  have  found  himself  a  different  profession.^  The  third  is  poHtical 
economy:  Governments  can  and  often  do  make  it  harder  for  people  to  do 
what  they  are  best  at  doing.''  The  fourth  is  learning:  People  may  not  know 
what  they  ought  to  be  doing,  and  even  when  they  do  the  rest  of  the  world 
may  not  appreciate  them.  For  example,  a  growing  body  of  evidence  shows 
that  farmers  are  often  ignorant  or  suspicious  of  more  rewarding  crops  and 
better  seeds. ^  The  final  answer  comes  from  what  has  come  to  be  called 
behavioral  economics:  People  may  not  always  seek  out  the  best  options 
because  they  are  held  back  by  psychological  constraints  or  social  norms.^ 

The  fact  that  this  survey  concentrates  on  the  contract  theoretic  argument 
should  not  be  interpreted  as  evidence  for  its  primacy.  But  it  is  the  argument 
that  has  received  the  most  elaboration  over  the  last  decade  or  so,  and  the 
one  that  best  matches  the  competencies  of  the  present  author.  It  is  therefore 
the  appropriate  topic  for  a  survey  like  this. 

^A  contract  theorist  would  probably  say  an  apple  is  not  an  apple  either — they  can  be 
stale  or  fresh,  sweet  or  sour.  If  you  had  a  lot  of  apples  to  sell,  you  would  probably  want 
to  invest  in  a  reputation  for  selling  only  fresh  and  sweet  apples. 

'  There  is  a  long  tradition  in  development  economics  of  models  that  emphasize  coordi- 
nation failures  going  back  at  least  to  Paul  Rosenstein-Rodan  (1943)  and  Ragnar  Nurkse 
(1953).  See  Murphy,  Shleifer  and  Vishny  (1989)  for  a  model  where  there  is  failure  of 
coordination  between  producers  of  different  goods,  and  Kremer  (1993)  for  a  model  for  a 
model  of  matching  between  different  types  of  talents. 

■"There  is,  of  course,  a  long  tradition  here,  going  back  to  Adam  Smith.  Krueger  (1974) 
and  Bhagwati  (1982),  among  others,  study  the  distortions  in  the  allocation  of  talent  and 
resources  that  come  from  government  policies. 

*See  Besley  and  Case  (1994)  and  Munshi  (2000).  Banerjee  (1992)  provides  theoretical 
arguments  for  why  such  behavior  may  be  rational  for  individual  farmers. 

There  is  a  long  and  controversial  literature  on  this  point.  The  famous  Lewis  model 
(Lewis,  1954)  argued  that  family  norms  could  discourage  people  from  seeking  outside 
options.  The  rational  peasant  model  (Schultz,  1964)  was  articulated  as  a  critique  of 
models  like  the  Lewis  model  (see  also  Cole,  Mailath  and  Postlewaite,  1992,  and  Banerjee 
and  Newman,  1998,  for  two  very  different  attempts  to  reconcile  these  views).  It  is,  however, 
time  to  revisit  this  issue:  With  the  increasing  sophistication  of  the  psychological  models 
used  in  economics,  it  is  now  possible  to  re-ask  the  question  of  whether,  for  example, 
poverty  can  have  direct  discouragement  effects. 


2  Contract  Theory  in  Development  Economics 

Contract  theoretic  arguments  in  development  economics  go  back  at  least 
to  the  work  of  D.  Gale  Johnson  in  the  1940's  and  1950's  in  the  context 
of  land  markets/  Stiglitz's  1974  paper  on  sharecropping,  among  others, 
started  a  tradition  of  formal  contract  theoretic  models  that  seek  to  explain 
why  landlords  and  tenants  often  settle  into  arrangements  that  are,  at  least 
apparently,  less  than  first  best  efficient.^ 

Since  then,  similar  principles  have  been  applied  to  the  study  of  all  the 
other  important  markets:  capital,  insurance  and  human  capital.  The  result 
is  an  enormous  literature  which  I  could  not  even  begin  to  do  justice  to  within 
the  hmits  of  this  survey.  I  will  confine  myself,  therefore^  to  elaborating  on 
a  single  example  from  the  market  for  capital,  which  I  hope  will  allow  me  to 
draw  out  the  most  important  themes,  though  at  several  points  in  the  text 
I  will  point  out  the  connections  with  what  is  understood  about  the  other 
asset  markets. 

3  The  Credit  Market 

The  facts  about  the  credit  market  are  remarkably  stark.  While  neoclassical 
theory  predicts  a  single  price  of  capital  at  which  people  both  borrow  and 
lend,  at  any  point  over  the  last  twenty  years  one  could  point  to  a  set  of 
peoples  in  world  (most  recently  the  Japanese)  who  were  earning  a  negative 
return  on  their  savings,  while  another  set  of  people  were  borrowing  at  real 
rates  of  60%  or  more. 

Indeed,  more  often  than  not,  very  large  differences  between  borrowing 
and  lending  rates  can  be  found  within  a  single  sub-economy.  Banerjee  (2001) 
reviews  a  number  of  empirical  studies  of  individual  credit  markets  in  devel- 
oping countries  and  lists  six  salient  features:^ 

1.  Sizeable  gap  between  lending  rates  and  deposit  rates  within 
the  same  sub-economy: 


'See  Johnson  (1950). 

*See  Stigltiz  (1974)  and  Cheung  (1968). 

"The  review  focuses  on  the  informal  sector  because  the  formal  banking  sector  in  most 
developing  countries  has  tended  to  be  quite  rigid  (interest  rate  caps,  strict  rules  about 
collateral,  inflexible  credit  hmits,  etc.  [see  Ghate,  1992])  with  the  result  that  the  informal 
sector  became  the  supplier  of  the  marginal  units  of  capital  for  all  but  the  very  largest 
firms. 


Ghatak  (1976)  reports  data  on  interest  rates  paid  by  cultivators  in 
India  from  the  All  India  Rural  Credit  Survey  for  the  1951-2  to 
1961-2  period:  The  average  rate  varies  between  a  maximum  of 
18%  (in  1959-60)  and  a  minimum  of  about  15%  (in  1961-62). 
These  numbers  are,  however,  slightly  misleading:  around  25%  of 
the  borrowing  reported  in  these  surveys  were  zero-interest  loans, 
usually  from  family  members  or  friends.  These  should  be  seen 
as  gifts/insurance  rather  than  loans.  If  these  were  left  out,  the 
average  rates  in  these  surveys  would  be  above  20%.  We  are  not 
told  what  the  comparable  rates  for  depositors  were  in  this  period, 
but  Ghatak  reports  that  the  bond  rate  in  this  period  was  around 
3%  and  the  bank  deposit  rate  was  probably  about  the  same. 

Timberg  and  Aiyar  (1984)  report  data  on  indigenous  style  bankers 
in  India,  based  on  surveys  that  they  carried  out.  They  report 
the  gap  between  the  average  rate  charged  to  borrowers  and  the 
average  rate  to  depositors  by  Finance  Companies  was  16.5%. 
The  same  gap  for  financiers  from  the  Shikarpuri  community  was 
16.5%,  12%  for  financiers  from  the  Gujerati  community,  15.5% 
for  the  Chettiars,  11.5%  for  the  Rastogis,  etc. 

The  "Summary  Report  on  Informal  Credit  Markets  in  India"  (Das- 
gupta,  1989)  reports  results  from  a  number  of  case  studies  that 
were  commissioned  by  the  Asian  Development  Bank  and  carried 
out  under  the  aegis  of  the  National  Institute  of  Pubhc  Finance 
and  Policy.  For  the  rural  sector,  the  data  is  based  on  surveys 
of  6  villages  in  Kerala  and  Tamil  Nadu,  carried  by  the  Centre 
for  Development  Studies.  The  average  interest  rate  charged  by 
professional  money-lenders  (who  provide  45.61%  of  the  credit)  in 
these  surveys  is  about  52%.  While  the  average  deposit  rate  is  not 
reported,  the  maximum  from  aU  the  case  studies  is  24%  and  the 
maximum  in  four  out  of  the  eight  case  studies  is  no  more  than 
14%.  For  the  urban  sector,  the  data  is  based  on  various  case 
surveys  of  specific  classes  of  informal  lenders:  For  Finance  Cor- 
porations they  report  that  the  maximum  deposit  rate  for  loans  of 
less  than  a  year  is  12%  while  the  minimum  lending  rate  is  48%. 
For  hire-purchase  companies  in  Delhi,  the  deposit  rate  was  14% 
and  the  lending  rate  was  at  least  28%.  For  auto-financiers  in  Na- 
makkal,  the  gap  between  the  deposit  rate  and  the  lending  rate 
was  19%.^*^    For  handloom  financiers  in  Bangalore  and  Karur, 


This  number  and  all  other  information  about  this  gap  is  measured  in  percentage 


the  gap  between  the  deposit  rate  and  the  lowest  lending  rate  was 
26%." 

Aleem  (1990)  reports  data  from  a  study  of  professional  moneylenders 
that  he  carried  out  in  a  semi-urban  setting  in  Pakistan  in  1980- 
1981.  The  average  interest  rate  charged  by  these  lenders  is  78.5%. 
The  bank  rate  in  that  year  in  Pakistan  was  10%.  However,  it  is 
possible  that  depositors  in  this  area  may  not  have  been  depositing 
in  the  banks,  so  an  alternative  measure  of  the  gap  can  be  obtained 
by  using  the  Aleem's  numbers  for  the  opportunity  cost  of  capital 
to  these  money-lenders,  which  is  32.5%.^" 

2.  Extreme  variability  in  the  interest  rate  charged  by  lenders  for 
superficially  similar  loan  transactions  within  the  same  econ- 
omy: 

Timberg  and  Aiyar  (1984)  report  that  the  rates  for  Shikarpuri  fi- 
nanciers varied  between  21%  and  37%  on  loans  to  members  of  lo- 
cal Shikarpuri  associations  and  between  21%  and  120%  on  loans 
to  non-members  (25%  of  the  loans  were  to  non-members  and  an- 
other 50%  were  loans  through  brokers).  On  the  other  hand,  the 
Gujerati  bankers  charged  rates  of  no  more  than  18%.  Moreover, 
the  rates  faced  by  established  commodity  traders  in  the  Calcutta 
and  Bombay  markets  were  never  above  18%  and  could  be  as  low 
as  9%. 

The  "Summary  Report  on  Informal  Credit  Markets  in  India"  (Das- 
gupta,  1989)  reports  that  Finance  Corporations  offer  advances 
for  a  year  or  less  at  rates  between  48%  per  year  and  the  utterly 
astronomical  rate  of  5%  per  day.  The  rates  on  loans  of  more  than 
a  year  varied  between  24%  and  48%.  Hire-purchase  contracts  of- 
fer rates  between  28%  to  41%  per  year.  Handloom  Financiers 
charge  rates  between  44%  and  68%.  Yet  the  Shroffs  of  Western 
India  offer  loans  at  less  than  21%  and  Chit  Fund  members  can 
borrow  at  less  than  25%. 


points. 

"A  number  of  other  lending  instituions  are  also  mentioned  in  this  study.  However, 
the  range  of  both  deposit  rates  and  lending  rates  is  so  wide  in  these  cases  that  the  gap 
between  the  minimum  lending  rate  and  the  maximum  deposit  rate  is  not  very  large.  This 
does  not  rule  out  the  possibility  that  the  gap  between  the  average  borowing  and  lending 
rate  is  quite  substantial  even  in  these  cases. 

'^This,  however,  understates  the  gap,  since  the  money-lenders  themselves  borrow  this 
money  and  the  original  lenders  are  paid  much  less  than  32.5%. 


The  same  report  tells  us  that  among  rural  lenders,  the  average  rate 
for  professional  money-lenders  (who  in  this  sample  give  about 
75%  of  the  commercial  informal  loans)  was  51.86%,  whereas  the 
rates  for  the  agricultural  money-lenders  (farmers  who  also  lend 
money)  who  supply  the  rest  was  29.45%.  Within  the  category  of 
professional  money-lenders,  about  half  the  loans  were  at  rates  of 
60%  or  more  but  another  40%  or  so  had  rates  below  36%. 

The  study  by  Aleem  (1990)  reports  that  the  standard  deviation  of  the 
interest  rate  was  38.14%  compared  to  an  average  lending  rate  of 
78.5%.  In  other  words,  an  interest  rate  of  2%  and  an  interest  rate 
of  150%  are  both  within  two  standard  deviations  of  the  mean. 

Swaminathan  (1991)  reports  on  a  survey  of  two  villages  in  South  India 
that  she  carried  out:  The  average  rate  of  interest  in  one  village 
varied  between  14.8%  for  loans  collateralized  by  immovable  assets 
(land,  etc.)  and  60%  for  loans  backed  by  moveable  assets.  The 
corresponding  rates  in  the  other  village  were  21%  and  70.6  %. 
Even  among  loans  collateralized  by  the  same  asset — gold — the 
average  rate  in  one  village  was  21.8%  but  it  went  up  to  58.8% 
when  the  loans  were  to  landless  laborers. 

Ghate  (1992)  reports  on  a  number  of  case  studies  from  all  over  Asia: 
The  case  study  from  Thailand  found  that  interest  rates  were  2- 
3%  per  month  in  the  Central  Plain  but  5-7%  in  the  north  and 
north-east  (note  that  5  and  7  are  very  different). 

Gill  and  Singh  (1997)  report  on  a  survey  of  6  Punjab  villages  they 
carried  out.  The  mean  interest  rate  for  loans  up  to  Rs  10,000  is 
35.81%  for  landowning  households  in  their  sample,  but  80.57% 
for  landless  laborers. 

Fafchamps'  (2000)  study  of  informal  trade  credit  in  Kenya  and  Zim- 
babwe reports  an  average  monthly  interest  rate  of  2.5%  (corre- 
sponding to  annualized  rate  of  34%)  but  also  notes  that  this  is  the 
rate  for  the  dominant  trading  group  (Indians  in  Kenya,  whites  in 
Zimbabwe).  Blacks  pay  5%  per  month  in  both  places. ^^ 

Irfan  et  al.  (1999),  mentioned  above,  report  that  interest  rates  charged 
by  professional  money-lenders  vary  between  48%  and  120%. 

3.  Low  levels  of  default: 


Fafchamps  notes  that  when  he  controls  for  the  sector  of  the  economy,  etc.,  this  differ- 
ence goes  away  but  that  just  tells  us  that  the  source  of  the  variation  is  sector  rather  than 
race. 


Timberg  and  Aiyar  (1984)  report  that  average  default  losses  for  the 
informal  lenders  they  studied  ranges  between  0.5%  and  1.5%  of 
working  funds. 

The  "Summary  Report  on  Informal  Credit  Markets  in  India"  (Das- 
gupta,  1989)  attempts  to  decompose  the  observed  interest  rates 
into  their  various  components/"*  and  finds  that  the  default  costs 
explain  14  per  cent  (not  14  percentage  points!)  of  the  total  in- 
terest costs  for  the  Shroffs,  around  7%  for  auto-financiers  in  Na- 
makkal  and  handloom  financiers  in  Bangalore  and  Karur,  4%  for 
Finance  Companies,  3%  for  hire-purchase  companies  and  essen- 
tially nothing  for  the  Nidhis. 

The  same  study  reports  that  in  four  case  studies  of  money-lenders  in 
rural  India  they  found  default  rates  explained  about  23%  of  the 
observed  interest  rate. 

The  study  by  Aleem  gives  default  rates  for  each  individual  lender. 
The  median  default  rate  is  between  1.5  and  2%  and  the  maximum 
is  10%. 

Production  and  trade  finance  are  the  main  reasons  given  for 
borrowing,  even  in  cases  where  the  rate  of  interest  is  rela- 
tively high: 

Ghatak  (1976)  concludes  on  the  basis  of  his  study  that  "the  existing 
belief  about  the  unproductive  use  of  loans  by  Indian  cultivators 
...  has  not  been  substantiated." 

Timberg  and  Aiyar  (1984)  report  that  for  Shikarpuri  bankers  (who 
charge  31.5%  on  average,  and  as  much  as  120%  on  occasion),  at 
least  75%  of  the  money  goes  to  finance  trade  and,  to  lesser  extent, 
industry. 

The  "Summary  Report  on  Informal  Credit  Markets  in  India"  (Das- 
gupta,  1989),  reports  that  several  of  the  categories  of  lenders  that 
have  been  already  mentioned,  such  as  hire-purchase  financiers 
(interest  rates  between  28%-41%),  handloom  financiers  (44%- 
68%),  Shroffs  (18%-21%)  and  Finance  Corporations  (24%-48% 
for  longer  term  loans  and  more  than  48%  on  loans  of  less  than 
a  year)  focus  almost  exclusively  on  financing  trade  and  industry, 
and  even  for  Chit  Funds  and  Nidhis,  which  do  finance  consump- 
tion, trade  and  industry  dominate. 


^In  the  tradition  of  Bottomley  (1963). 


Swaminathan  (1991)  reports  that  in  the  two  villages  she  surveys,  the 
share  of  production  loans  in  the  portfolio  of  lenders  is  48.5%  and 
62.8%.  The  higher  share  of  production  loans  is  in  Gokalipuram, 
which  has  the  higher  interest  rates  (above  36%  for  all  except  the 
richest  group  of  borrowers) . 

Ghate  (1992)  also  concludes  that  the  bulk  of  informal  credit  goes  to 
finance  trade  and  production. 

Murshid  (1992)  studies  Dhaner  Upore  loans  in  Bangladesh  (you  get 
some  amount  in  rice  now  and  repay  some  amount  in  rice  later) 
and  argues  that  most  loans  in  his  sample  are  production  loans 
despite  the  fact  that  the  interest  rate  is  40%  for  a  3-5  month  loan 
period. 

Gill  and  Singh  (1997)  report  that  the  bulk  (63.03%)  of  borrowing  from 
the  informal  sector  goes  to  finance  production.  This  proportion  is 
lower  for  the  landless  laborers  but  it  is  an  non-negligible  fraction 
(36%). 

5.  Richer  people  borrow  more  and  pay  lower  rates  of  interest. 

Ghatak  (1976)  correlates  asset  category  with  borrowing/debt  in  the 
All  India  Rural  Credit  Survey  data  and  finds  a  strong  positive 
relationship. 

Timberg  and  Aiyar  (1984)  report  that  some  of  the  Shikarpuri  and 
Rastogi  lenders  set  a  credit  limit  that  is  proportional  to  the  bor- 
rower's net  worth:  Several  lenders  said  that  they  would  lend  no 
more  than  25%  of  the  borrower's  net  worth,  though  another  said 
he  would  lend  up  to  33%. 

The  "Summary  Report  on  Informal  Credit  Markets  in  India"  (Das- 
gupta,  1989)  tells  us  that  in  their  rural  sample,  landless  laborers 
paid  much  higher  rates  (ranging  from  28-125%)  than  cultivators 
(who  paid  between  21  and  40%).  Moreover,  Table  15.9  in  that 
report  clearly  shows  that  the  average  interest  rate  declines  with 
loan  size  (from  a  maximum  of  44%  to  a  minimum  of  24%).  The 
relation  between  asset  category  and  interest  rate  paid  is  less  clear 
in  their  data  but  it  remains  that  the  second  poorest  group  (those 
with  assets  in  the  range  Rs  5,000-10,000)  pays  the  highest  average 
rate  (120%)  and  the  richest  (those  with  more  than  Rs  100,000) 
pay  the  lowest  rate  (24%). 


Swaminathan  (1991)  finds  a  strong  negative  relation  between  the 
value  of  the  borrower's  land  assets  and  the  interest  rate  he  faces: 
The  poorest  (those  with  no  land  assets)  pay  44.9%  in  one  village 
and  45.4%  in  the  other,  while  the  rich  (those  with  land  valued  at 
more  than  Rs  50,000)  pay  16.9%  and  24.2%  in  the  corresponding 
villages. 

Gill  and  Singh  (1997)  show  that  correlation  between  the  wealth  of  the 
borrower  and  loan  size  is  negative  after  controlling  for  the  interest 
rate.  They  also  find  a  positive  relation  between  the  borrower's 
wealth  and  the  loan  he  gets. 

6.  Bigger  loans  are  associated  with  higher  interest  rates. 

Table  15.9  in  the  "Summary  Report  on  Informal  Credit  Markets  in 
India"  (Dasgupta,  1989)  clearly  shows  that  the  average  interest 
rate  declines  with  loan  size  (from  a  maximum  of  44%  to  a  mini- 
mum of  24%). 

Ghate  (1992)  notes  that  the  interest  rate  on  very  small  loans  in 
Bangladesh  tends  to  be  very  high  (Taka  10  per  week  on  a  loan  of 
Taka  500,  or  86%  per  annum). 

Gill  and  Singh  (1997)  show  that  the  correlation  between  loan  size  and 
the  interest  rate  is  negative  even  after  they  control  for  the  wealth 
of  the  borrower. 

3.1      Tciking  Stock:  The  Facts  about  Credit  Markets 

The  fact  that  there  is  a  gap  between  the  lending  rate  and  the  rate  paid  to 
depositors  is  not,  per  se,  surprising.  The  fact  that  intermediation  is  costly 
is,  after  all,  entirely  commonplace.  What  is  striking  is  the  size  of  the  gap. 
It  is  always  more  than  10%  and  usually  more  than  14%,  in  a  world  where 
interest  rates  paid  to  depositors  are  rarely  more  than  20%  and  usually  closer 
to  10%.  In  other  words,  intermediation  costs  seem  to  eat  up  at  least  a  third 
and  often  half  (and  sometimes  much  more  than  half)  of  the  income  that 
could  go  to  depositors. 

However,  this  argument  overstates  the  point  slightly.  The  probabihty 
that  a  moneylender  would  default  on  his  deposit  liabilities  is  substantially 
lower  than  the  probability  that  borrowers  will  default  on  the  loan,  which 
implies  that  the  default  premium  on  loans  should  be  much  greater  than  the 
default  premium  on  deposits.  From  the  evidence  reported  above,  default  is 
relatively  rare  and  default  costs  rarely  raise  the  interest  rate  by  more  than 
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10%.  The  gap  between  loan  rates  and  deposit  rates  would  be  very  large  even 
if  we  were  to  deduct  10%  from  the  loan  rate.'^^^ 

The  fact  that  interest  rates  vary  quite  so  much  is  particularly  striking 
given  the  standard  neoclassical  prediction  that  in  market  equilibrium  the 
marginal  unit  of  capital  in  every  firm  should  earn  the  same  return.  However, 
given  that  people  might  be  rationed  in  the  credit  market,  it  is  theoretically 
possible  that  the  marginal  product  of  capital  is  actually  equal  in  all  its 
uses,  despite  the  enormous  disparities  in  the  interest  rate.  Note  that  the 
incremental  capital/output  ratio  for  the  Indian  economy  is  estimated  to 
be  around  4.3,  implying  an  marginal  return  on  capital  of  24%.  This  is, 
however,  a  gross  measure  and  the  true  return,  net  of  depreciation,  is  clearly 
substantially  lower  (no  more  than  20%).  The  fact  that  interest  rates  above 
35%  are  standard,  and  those  above  75%  are  by  no  means  rare,  suggests  that 
at  least  some  of  the  users  of  capital  must  value  capital  at  substantially  more 
than  20%. 

Could  it  be  that  all  of  the  demand  at  relatively  high  interest  rates  comes 
from  people  who  have  particularly  insistent  consumption  needs  today?  This 
is  certainly  not  the  stated  purpose  of  the  loans,  as  noted  above.  Of  course, 
money  is  fungible  and  one  cannot  rule  out  the  possibility  that  some  of  these 
people  are  either  deluded  or  untruthful.  However,  it  remains  that  when 
a  hajidloom  producer  borrows  at  48%  or  more  to  finance  consumption,  he 
chose  to  do  so  instead  of  taking  the  money  out  of  his  existing  business. 
Therefore,  it  must  be  that  he  would  lose  more  than  48%  on  any  money  that 
comes  out  of  his  business.  This  may  be  in  part  because  in  the  short  run 
his  assets  are  not  liquid,  but  this  could  not  explain  why  he  accepts  ongoing 
financing  on  these  terms.  Therefore,  he  must  be  earning  marginal  returns 
that  are  close  to  48%.^'^ 


One  might  be  worried  that  while  default  rates  are  low  on  average,  default  may  be  very 
important  in  those  cases  where  the  interest  rate  is  high.  However,  this  is  not  a  problem 
since,  for  the  most  part,  we  look  at  interest  rates  and  default  rates  weighted  by  volume 
(or  equivalently,  do  a  Bottomley  [1975)  decomposition).  Moreover,  in  the  one  detailed 
micro-study  we  have  where  the  average  interest  rate  is  very  high  (Aleem,  1990),  default 
rates  are  actually  very  low  (always  less  than  10%,  and  usually  less  than  2%). 

'^  Delay  in  repayment  for  which  no  extra  interest  is  charged  is  another  factor  that  reiises 
the  interest  rate.  In  Aleem's  data,  delay  is  much  more  common  than  default,  but  in  a 
significant  fraction  of  the  cases  the  lender  is  able  to  charge  interest  for  the  extra  days. 
Moreover,  the  percentage  of  loans  that  are  late  never  exceeds  25%  and  the  average  delay 
is  no  more  than  6  months,  so  at  worst  this  would  raise  the  interest  rate  by  a  factor  of  1.12. 

There  is,  once  again,  the  possibihty  that  a  part  of  the  reason  why  these  rates  are  so 
high  is  because  of  default  risk.  In  other  words,  the  expected  return  on  the  marginal  units 
of  capital  need  not  be  as  high  as  48%.  However  as  already  observed,  defaults  contribute 
relatively  little  to  the  level  of  the  interest  rate. 
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The  fact  that  the  marginal  product  of  capital  varies  substantially  across 
borrowers  in  the  same  sub-economy  is  supported  by  more  direct  evidence 
from  the  knitted  garment  industry  based  on  data  that  I  collected  in  joint 
work  with  Kaivan  Munshi  (Banerjee  and  Munshi,  2001).  Tirupur  produces 
70%  of  India's  knitted  garment  exports,  and  India  is  a  major  exporter  of 
knitted  garments.  There  are  two  communities  of  producers  in  Tirupur: 
Gounders,  who  are  linked  by  community  ties  to  a  rich  local  agricultural 
community;  and  Outsiders,  a  motley  crew  of  businessmen  from  all  over 
India.  They  produce  exactly  the  same  goods,  yet  they  use  radically  different 
technologies.  Gounders  invest  much  more  than  Outsiders  at  all  levels  of 
experience,  both  in  absolute  terms  and  relative  to  output.  Average  capital- 
output  ratios  for  Gounders  can  be  three  times  as  large  as  that  for  Outsiders 
and  is  typically  twice  as  large.  However,  all  the  evidence  points  to  the 
Outsiders  being  more  able:  they  enjoy  faster  output  growth  and  their  output 
outstrips  that  of  the  Gounders  after  a  few  years. 

One  possible  situation  where  high  ability  people  may  invest  less  would 
be  if  capital  were  less  useful  for  them,  which  would  be  the  case  if  abihty  and 
capital  were  substitutes  in  the  production  function.  The  evidence,  however, 
points  against  this  explanation:  When  we  compare  Gounders  with  Gounders 
or  Outsiders  with  Outsiders,  it  is  clear  that  those  who  grow  faster  and 
produce  more  also  invest  more.  Therefore,  it  seems  relatively  clear  that  the 
Outsiders  invest  less  despite  having  a  higher  marginal  product  of  capital. 

What  explains  why  the  credit  markets  behave  in  this  way?  Why  is 
intermediation  so  inefficient  with  some  people  and  so  efficient  with  others? 
Why  are  the  rich  borrowers  and  those  who  borrow  more  favored  by  the 
market? 

The  standard  theory  of  interest  rates  decomposes  them  into  default  rates, 
opportunity  cost,  transaction  costs  and  monopoly  rents.  This  is  useful  de- 
scriptively but  stops  well  short  of  an  explanation — the  problem  is  that  none 
of  these  can  be  seen  as  independent  causal  factors.  Take  the  example  of 
default  rates.  The  fact  that  default  rates  are  relatively  low  is  not  a  fact 
about  the  nature  of  default:  Timberg  and  Aiyar  (1984)  observe  that  some 

'*This  is  what  Caballero  and  Hamraour  (2000)  call  scrambling.  The  proximate  reason,  it 
appears,  is  the  Gounders  have  a  lot  of  investible  funds  that  they  cannot  profitably  lend  out 
because  intermediation  is  so  inefficient  in  India.  Instead,  they  set  up  their  own  garment 
firms  or  lend  to  friends  and  family  in  the  garment  business.  Since  these  firms  are  set  up 
as  a  conduit  for  this  surplus  capital,  they  are  not  required  to  be  particularly  productive. 
The  Outsiders,  by  contrast,  come  from  traditional  entrepreneurial  communities  and,  as 
a  result,  their  capital  probably  has  many  alternative  uses.  In  other  words,  they  do  not 
invest  in  Tirupur  because  they  lack  other  choices.  This  makes  them  more  likely  to  be 
productive  but  also  less  willing  to  invest  a  lot. 
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branches  of  the  state-owned  commercial  banks  in  India  have  default  rates 
up  to  60-70%.  The  low  default  rates  observed  in  the  studies  we  mention  are 
a  result  of  the  steps  taken  by  lenders  to  avoid  default. 

Monitoring  the  borrower  is  an  obvious  example  of  the  kind  of  steps  that 
lenders  take.  It  is  also  an  important  source  of  what  goies  under  the  rubric 
of  transaction  costs:  Aleem  (1990)  and  Irfan  et  al.  (1999)  provide  a  list  of 
steps  taken  by  the  lender  to  avoid  default.  These  include  getting  to  know 
the  borrower  through  other  transactions,  visiting  his  establishment,  making 
enquiries  about  him  and  going  after  him  to  make  sure  he  repays. 

Lenders  also  protect  themselves  by  limiting  their  lending  to  borrowers 
they  know.^^  This  has  four  important  consequences.  First,  it  pushes  capital 
towards  well-connected  borrowers  and  away  from  less  well-connected  bor- 
rowers, even  when  there  is  no  difference  in  their  productivity.  Second,  it 
makes  it  important  that  lending  be  local — the  lender  must  know  and  trust 
his  borrowers.  This  adds  one  or  more  layers  of  intermediation  to  the  process 
of  lending,  with  additional  transaction  costs  entering  at  each  of  the  stages, 
which  raises  the  opportunity  cost  of  capital.  Third,  it  forces  the  lender  to 
limit  his  lending,  with  the  consequence  that  both  his  capital  and  his  skills 
as  a  lender  may  remain  unused  for  a  significant  part  of  the  time.  This  raises 
both  the  opportunity  cost  of  the  capital  and  the  transaction  cost  (which 
includes  a  part  of  the  lender's  time).  Finally,  it  gives  the  lender  some  ex 
post  monopoly  power,  as  a  borrower  would  find  it  hard  to  leave  a  lender 
who  knows  him  well.  Under  competitive  conditions,  these  ex  post  rents  will 
be  dissipated  in  ex  ante  competition,  with  lenders  in  effect  subsidizing  new 
borrowers  in  order  to  extract  rents  later  from  those  who  will  become  his 
long-term  clients. 

What  this  tells  us  is  that  the  four  components  of  the  interest  rate  are  all 
jointly  determined  in  the  process  of  the  lender  making  his  lending  decisions. 
Depending  on  the  lender's  strategy,  it  could  be  that  the  transaction  costs 
dominate  or  the  opportunity  cost  dominates  or  that  default  or  monopoly 
rents  become  very  important.  The  strategy  could  be  very  different  depend- 
ing on  the  nature  of  the  clientele  and  other  environmental  characteristics. 
This  may  be  a  part  of  the  reason  why  different  people  have  taken  very  dif- 
ferent views  of  informal  credit  markets:  Aleem,  for  example,  finds  that  for 
every  rupee  lent,  about  half  a  rupee  goes  into  transaction  costs,  while  in 
Dasgupta  (1989)  only  about  30%  of  interest  costs  are  explained  by  trans- 
action costs  (strictly  establishment  costs)  while  Ghate  (1992)  argues  that 

^"See  McMillan  and  Woodruff  (1999). 
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transaction  costs  are  unimportant  except  in  the  case  of  very  small  loans. ^^ 
The  fact  that  all  these  decisions  are  interrelated  clearly  makes  it  danger- 
ous to  use  any  single  one  of  these  components  as  a  measure  of  the  efficiency  of 
intermediation.  For  example,  Ghate  (1992)  sees  the  low  level  of  transaction 
costs  in  his  sample  as  evidence  for  the  remarkable  efficiency  of  informal  lend- 
ing. But,  as  has  already  been  noted,  transaction  costs  may  be  low  because 
the  lenders  are  very  choosy  about  to  whom  they  lend.  This  raises  the  op- 
portunity cost  of  capital  (since  capital  is  often  idle)  and  limits  credit  access, 
both  of  which  have  their  welfare  costs.  Likewise,  the  low  rate  of  defaults  in 
informal  transaction  is  often  mentioned  as  evidence  for  their  efficiency,  but 
this  is  obviously  misleading  if  it  comes  at  the  cost  of  increased  monitoring 
or  reduced  lending.  Finally,  the  presence  of  rents  in  lending  is  not,  per  se, 
evidence  for  lack  of  competition  in  the  market.  As  pointed  out  above,  in 
this  type  of  market,  ex  post  rents  are  consistent  with  ex  ante  competition. 
A  further  implication  of  this  observation  is  that  both  loan  size  and  the 
interest  rate  are  jointly  determined  and  therefore  one  cannot  give  a  causal 
interpretation  of  the  relation  between  interest  rates  and  loan  size  reported 
above.  Rather,  one  should  see  both  of  these  as  outcomes  that  are  deter- 
mined by  more  primitive  variables  such  as  the  wealth  of  the  borrower,  his 
productivity,  the  liquidity  of  his  assets,  etc.  This  also  makes  it  harder  to 
interpret  the  reported  negative  relation  between  the  borrower's  wealth  and 
the  interest  rate.  In  principle,  it  could  be  entirely  a  result  of  the  fact  that 
rich  borrowers  borrow  more. 

Most  importantly,  this  line  of  argument  underscores  the  importance  of 
developing  a  proper  theory  of  credit  markets.  Such  a  theory  would  explain 
the  variation  in  interest  rates  and  the  gap  between  interest  rates  and  deposit 
rates  in  terms  of  the  true  primitives  of  the  model,  and  make  predictions 
about  the  relation  between  loan  size  and  interest  rates  and  borrower  and 
lender  characteristics.  While  there  is  a  long  tradition  of  models  of  imperfect 
credit  markets  going  back  to  JafFee  and  Russel  (1976)  and  Stiglitz  and  Weiss 
(1981),  and  the  arguments  behind  why  credit  markets  can  fail,  based  on 
moral  hazard  and/or  adverse  selection,  are  well-known,  I  feel  that  it  is  useful 
to  develop  a  framework  that  has  a  more  direct  empirical  orientation. 

^"Even  within  the  set  of  case-studies  reported  by  Ghate,  there  seems  to  be  considerable 
variation.  In  Kerala,  the  case-study  concludes  that  transaction  costs  are  of  negligible 
importance  while  the  Thai  study  concludes  that  transaction  costs  added  between  3  andl4 
percentage  points  to  the  interest  cost. 
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3.2     A  Simple  Model  of  Moral  Hazard  in  the  Credit  Market 

There  is  an  investment  opportunity  whose  gross  returns  are  F{K)R{p)  with 
probabihty  p  and  0  otherwise,  where  K  is  the  amount  invested  and  F{-)  is 
a  production  function.  If  an  investor  wants  to  invest  more  than  his  wealth, 
W,  he  will  need  to  borrow.  There  is  a  capital  market  and  the  (gross)  cost  of 
capital  in  that  market  is  p.  To  make  this  problem  interesting  it  is  assumed 
that: 

1 .  p  is  a  choice  for  the  investor  but  is  unobserved  by  the  lender,  p  takes 
a  value  between  po  and  p\ . 

2.  E{p)  =  pR{p)  has  the  property  that  E'{po)  >  0,  and  E"{p)  <  0. 

3.  The  only  possible  contract  is  a  loan  contract.^' 

3.2.1      The  Basic  Moral  Hazard  Problem 

The  optimal  value  of  p,  p*  is  clearly  greater  than  po  and  may  or  may  not 
be  less  than  pi .  The  combination  of  the  rest  of  the  assumptions  tell  us  that 
there  is  no  guarantee  that  p*  would  be  chosen  in  equilibrium.  To  see  this, 
note  that  the  borrower,  who  is  assumed  to  be  risk- neutral,  will  choose  p  to 
maximize  F{K)E{p)  —  pr{K  —  W)^  where  r  is  the  interest  rate  that  has  to 
be  paid  to  the  lender  to  make  him  wiUing  to  lend. 

The  borrower  will  choose  p  such  that  E'[p)F{K)  —  r{K  —  W)  =  0.^^ 
This  is  quite  obviously  inconsistent  with  the  social  optimum:  the  borrower 
clearly  wants  to  choose  p  <  p*  ■  This  is  the  standard  incentive  problem  in 
credit  markets:  Society  cares  about  net  output  but  the  borrower  only  cares 
about  what  remains  after  paying  interest.  This  is  the  essence  of  all  models 
of  ex  ante  moral  hazard  in  the  credit  market. 

Next,  notice  that  the  first  order  condition  for  the  borrower's  choice  of  p 
can  be  rewritten  in  the  form: 

^W^=r(l-f).  (I) 

Prom  this  equation  it  is  evident  that  p  depends  on  three  things:  The 
average  product  of  capital,  F{K)/K;  the  leverage  ratio  K/W;  and  the  in- 
terest rate,  r.  If  capital  is  more  productive,  the  borrower  is  less  inclined  to 

'This  rules  out  making  the  borrower's  payments  depend  on  the  project's  reahzed  re- 
turns. Diamond  (1989)  justifies  this  assumption  by  assuming  that  the  recJized  return  is 
not  pubhcly  observable  except  by  making  use  of  a  liquidation  proceeding,  which  is  costly 
to  the  point  of  using  up  all  available  output.  This  makes  sure  that  a  borrower  will  not 
willfully  default  as  long  as  the  lender  threatens  to  go  into  Uquidation  whenever  he  defaults. 
"Assuming  an  interior  optimum  exists. 
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misbehave,  and  this  is  reflected  in  a  lower  p.  Being  more  leveraged  worsens 
his  incentives  and  so  does  a  higher  interest  rate,  which  is  consistent  with  the 
observation  made  above  that  the  interest  cost  burden  is  the  source  of  the 
distortion. 

Property  1.  Efficiency:  There  is  less  inefficiency  in  the 
credit  relationship  when  there  is  less  leveraging,  when 
the  interest  rate  is  lower,  and  when  the  project  is  more 
productive. 

From  this  it  follows  that  the  equilibrium  value  of  p  can  be  written  in  the 
form: 

p  =  piR,F{K)/K)-, 

where  i?  =  r(l  —  ^)  is  the  interest  cost  per  unit  of  investment.  Clearly 
^  <  0  and  Qpix)IK  ^  ^-  Writing  the  relation  in  this  form  draws  attention 
to  the  important  role  played  by  the  shape  of  the  production  function.  When 
F{-)  is  concave,  F{K)/K  decreases  as  a  function  of  K.  Therefore,  those  who 
invest  more  will  be  more  liable  to  moral  hazard,  even  after  controlling  for 
the  leverage  ratio.  However,  if  F  is  convex,  at  least  over  a  range,  increasing 
the  level  of  investment  may  increase  profitability  and  improve  the  borrower's 
incentives.  As  we  will  see,  this  distinction  may  be  very  important  for  some 
questions. 

3.2.2      The  Interest  Rate 

We  have  so  far  treated  r  as  a  parameter.  In  fact,  if  there  is  competition  in 
lending,  lenders  should  not  make  any  profits,  which  would  imply  that 

r  =  p/p,  (2) 

or 

R  =  p{l-W/K)/p=-, 
P 

where  F  is  the  cost  of  capital  per  unit  of  investment. ^^  Solving^  =  p{R,  F{K)/K) 
along  with  R  =  T/p,  gives  us  p  =  p{T,F{K)/K)  and  R  =  R{T,F{K)/K). 

^^The  assumption  of  perfect  competition  in  the  credit  market  is  not  uncontroversial. 
There  is  a  long  tradition  of  papers  that  view  high  interest  rates  as  evidence  for  monopoly 
power  in  the  credit  market.  However, as  already  pointed  out,  the  issue  of  rents  in  the 
credit  market  is  likely  to  be  quite  delicate,  since  competition  operates  ex  ante  rather  than 
ex  post.  Therefore,  the  absence  of  ex  ante  rents  is  consistent  with  Bhaduri's  (1977)  model 
of  how  lenders  trap  borrowers  into  a  permanent  cycle  of  debt  and  debt  repayment.   The 
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However,  it  is  easy  to  construct  examples  where  these  equations  have  mul- 
tiple solutions:  Intuitively,  a  fall  in  p  raises  r,  but  a  rise  in  r,  as  we  already 
saw,  puts  downward  pressure  on  p.  It  is  not  clear,  however,  that  we  can 
interpreted  these  as  multiple  equihbria — if  the  lender  knows  the  rules  of  the 
game,  he  knows  that  he  can  pick  the  best  equilibrium  and  make  everyone 
better  off,  simply  by  setting  the  right  interest  rate.  Therefore,  unless  the 
lender  is  boundedly  rational,  we  should  probably  assume  that  the  best  equi- 
librium is  always  chosen.  This  is  the  equilibrium  with  the  lowest  interest 
rate. 

Assuming  that  this  is  the  equilibrium,  the  comparative  statics  of  the  p(-) 
function  are  inherited  by  the  p(-)  function,  and  r  =  &  shares  the  properties 
of  the  p  function,  only  reversed.  A  lower  leverage  ratio  increases  p  and  lowers 
the  interest  rate,  as  does  a  higher  average  product  of  capital.  Lowering  the 
cost  of  capital  lowers  the  rate  of  interest  more  than  proportionately  since 
the  repayment  rate  goes  up. 

Property  2.  Interest  Rates:  Borrow^ers  who  are  more 
leveraged  tend  to  pay  higher  rates,  while  more  produc- 
tive borrowers  pay  lower  rates.  Raising  the  cost  of  cap>- 
ital  raises  the  interest  rate  more  thEin  proportionately. 

3.2.3      The  Level  of  Investment 

The  next  step  is  to  endogenize  the  level  of  borrowing.  The  borrower's  choice 
of  K  maximizes 

F{K)E{p)  -  p{K  -  W) 

under  the  assumption  that  p  depends  on  K  through  the  p{-)  function.  The 
first  order  condition  for  that  maximization  is: 

If  we  compare  this  with  the  first  order  condition  in  a  first  best  world, 
F'{K)E{p*)  =  p,  we  see  that  there  are  three  sources  of  distortion.  First, 
E{p)  <  E{p*),  which  says  that  capital  is  less  productive  and  therefore  the 
borrower  wants  to  invest  less.  Second,  ^  is  negative  which  also  discourages 

evidence  seems  to  support  the  hypothesis  of  ex  ante  competition:  The  few  studies  (Ghate 
(1992),  Dasgupta  (1989),  Aleem  (1989)  that  compute  the  gap  between  the  interest  rate 
charged  and  the  various  costs  of  lending  (opportunity  cost,  monitoring  costs,  default  costs) 
do  not  find  a  large  gap  on  average,  though  one  cannot  reject  the  possibihty  that  there  is 
a  large  rent  component  in  many  individual  transactions. 
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investment.  Finally,  there  is  the  second  term  on  the  left-hand-side,  which  can 
be  positive  or  negative  depending  on  the  sign  of  — qk~-  This,  as  we  have 
already  observed,  depends  on  whether  the  production  function  is  concave 
or  not.  If  it  is  concave,  the  second  term  is  negative  and  it  is  unambiguously 
true  that  imperfections  in  the  capital  market  lead  to  less  investment.  If  not, 
the  second  term  may  be  positive  and  if  this  effect  is  large  enough,  it  could 
outweigh  the  first  effect  and  generate  over-investment.  Whether  this  possi- 
bility is  actually  worth  taking  seriously  remains  an  open  question,  awaiting 
more  precise  calibrations  of  the  model.^'* 

Another  important  property  of  the  first  best  is  that  the  amount  invested 
is  independent  of  the  wealth  of  the  investor.  In  our  present  model,  if  we  were 
to  increase  W,  keeping  K  fixed,  we  know  from  Property  1  that  p  would  go 
up,  raising  E{p)  and  reducing  E'{p).  As  long  as  F  is  concave,  both  of  these 
effects  go  in  the  same  direction:  They  both  raise  the  rewards  for  investing 
more,  and  therefore  there  is  more  investment. ^^  In  fact,  in  the  special  case 
where  F{K)  =  aK,  i.e.,  a  linear  production  technology,  K  not  only  goes  up 
when  W  goes  up,  it  is  precisely  proportional  to  W. 

The  general  case  of  a  non-concave  F  tends  to  be  complex.  One  inter- 
esting example,  where  there  is  a  single  indivisible  investment,  turns  out  to 
be  very  straightforward.  In  this  case  people  either  invest  or  do  not,  and 
since  those  who  have  more  wealth  choose  a  higher  p  at  the  same  level  of 
investment  (Property  1),  they  are  the  ones  who  will  invest.  More  generally, 
non-convex  production  technologies  raise  the  possibility  that  the  poor  vdll 
actually  invest  more  than  the  rich:  Intuitively,  if  the  production  function  is 
convex,  increasing  investment  raises  productivity,  which  improves  incentives 
through  its  direct  effect.  However,  there  is  also  an  indirect  effect:  Investing 
more  makes  the  borrower  more  leveraged  and  this  worsens  incentives.  The 
balance  of  these  two  effects  may  be  different  for  the  rich  and  the  poor,  since 
their  incentive  problems  are  different,  and  in  principle  it  could  be  that  the 
poor  end  up  investing  more.  However,  it  seems  unlikely  that  these  effects 
would  dominate  the  main  effect  of  being  richer,  which  is  that  (at  the  same 
level  of  investment)  richer  people  are  less  leveraged  and  therefore  have  better 
incentives  and  as  a  result  their  capital  is  more  productive. 

Lowering  the  cost  of  capital  in  this  model  increases  p  and  this,  as  argued 
above,  encourages  investment.  However,  lowering  the  cost  of  capital  also 
increases  the  amount  invested  in  the  first  best,  so  that  there  is  no  clear 

^''Lehnert,  Ligon  and  Townsend  (1999)  argue  that  this  is  a  real  possibihty. 
^^ Actually,    there   is   a   third   effect:     Increasing    W/K,    it   can   be   shown,    reduces 
dp/dF(K)/K,  thereby  reinforcing  the  effect  of  the  fall  in  E'{p). 
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prediction  for  the  extent  of  underinvestment. 

Property  3.  The  Level  of  Investment:  Capital  market 
imperfections  lead  to  under-investment  in  the  typical 
case,  though  it  is  not  inconceivable  that  they  could  gen- 
erate over-investment.  The  more  wealthy  will  tend  to 
invest  more  in  absolute  terms.  When  the  production 
technology  is  linear,  the  amount  invested  will  be  propor- 
tional to  the  investor's  wealth.  When  there  is  a  single 
indivisible  investment,  the  rich  are  more  likely  to  invest 
than  the  poor.  Lowering  the  cost  of  capital  increases 
investment. 

Capital  market  imperfections  reduce  the  demand  for  capital.  For  a  given 
supply  curve  of  capital,  this  means  that  the  cost  of  capital  will  be  lower  than 
it  would  be  otherwise.  In  the  longer  run,  however,  the  supply  of  capital 
will  also  respond  to  the  pattern  of  wealth  creation  generated  by  the  capital 
market  imperfection  and  the  net  impact  on  the  cost  of  capital  is  ambiguous. 

Property  4:  The  Cost  of  Capital:  For  a  given  supply 
curve  for  capital,  imperfect  capital  markets  will  have  a 
lower  cost  of  capital,  but  this  is  no  longer  necessarily 
true  once  we  take  into  account  the  impact  of  the  capital 
market  imperfection  on  the  supply  of  credit. 

3.2.4     Introducing  Monitoring 

The  model  developed  so  far  is  useful  in  developing  intuition  about  how  the 
credit  market  works  but  it  has  an  important  limitation  in  terms  of  explaining 
the  data.  As  we  have  already  seen,  the  repayment  rates  in  most  informal 
credit  transactions  are  very  high  (over  90%).  It  follows  from  equation  2  that 
the  interest  charged  by  a  competitive  lender  can  only  be  about  10%  higher 
than  the  cost  of  capital,  which  from  all  the  evidence  above  is  much  too  small 
a  margin. 

The  missing  piece  of  the  story  is  monitoring.  We  have  assumed  so  far 
that  the  lender  cannot  do  anything  to  affect  the  borrower's  choice  of  p.  This 
is  clearly  an  extreme  assumption,  since,  as  already  mentioned,  lenders  can 
and  do  monitor  borrowers. 

The  point  of  all  these  activities  is  to  learn  more  about  the  borrower. 
This  helps  in  two  ways:  First,  by  allowing  the  lenders  to  pick  borrowers  for 
whom  the  interval  [po,  Pi]  is  relatively  small,  thereby  hmiting  the  possibility 
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of  moral  hazard.  And  second  by  getting  to  know  the  borrower's  environment, 
thereby  making  it  easier  to  find  out  when  the  borrower  is  not  doing  what 
he  has  promised  to  do  with  the  money. 

In  addition  to  this  kind  of  ex  ante  monitoring  there  is  ex  post  monitoring 
of  the  project,  checking  that  the  borrower  has  done  what  he  had  promised 
to  do  with  the  money.  For  example,  the  lender  can  try  to  make  sure  that 
the  borrower  is  spending  the  money  on  inputs  for  his  project  rather  on 
consumption.  Finally  there  is  collection:  Once  the  loan  falls  due,  the  lender 
has  to  spend  time  chasing  each  overdue  loan. 

It  is  not  possible  to  capture  all  of  these  different  aspects  of  monitoring 
in  a  single  model,  so  we  limit  ourselves  to  one  specific  model,  though  some 
of  the  other  models  will  be  discussed  in  a  later  section.  We  introduce  mon- 
itoring into  the  model  by  assuming  that  if  the  lender  monitors  at  a  level  a, 
the  borrower  will  chose  a  project  p(o)  or  a  project  with  a  p  no  lower  than 
p(a).^^  We  assume  that  this  comes  about  either  through  ex  ante  monitor- 
ing of  the  project  (screening  of  projects  before  the  loan  is  given)  or  ex  post 
monitoring  of  the  project  (checking  on  the  borrower  after  he  has  been  given 
the  loan  and  punishing  him  if  he  has  not  done  what  he  was  supposed  to  do). 
The  problem  is  that  we  know  very  little  about  the  nature  of  the  empirical 
relation  between  monitoring  and  project  choice.  The  only  option  we  have  is 
to  reason  on  purely  a  priori  grounds. 

One  assumption  that  has  a  certain  plausibility  is  that  the  amount  of  mon- 
itoring necessary  is  a  function  of  the  extent  of  misahgnment  of  incentives 
between  the  borrower  and  the  lender.  The  borrower  in  our  model  wants  to 
choosep  =  p{R,F{K)/K),  which  gives  himapayoSoi  F(K)E{p{R,F{K)/K))- 
p{K /W,r, F{K) / K)r{K  —  W) ,  while  the  lender  wants  him  to  choosep  which 
gives  him  a  payoff  of  F{K)E{p)  —  pr{K  —  W).  The  extent  of  misahgnment 
is  therefore 

D  =  F{K)[E{p{R,  F{K)/K))  -  Eip)]  -  \p{R,  F{K)/K)  -  p]RK.       (4) 

Our  assumption  is  then  that  the  amount  of  monitoring  is  a  function  of  D. 
However,  in  order  to  allow  for  different  types  of  scale  effects,  we  write  it  in 
a  slightly  more  general  form: 

M  =  M(A',D//\",m), 

where  m  is  a  parameter  that  shifts  the  monitoring  cost  function  (^  >  0). 
"*The  borrower  will  typically  choose  the  lowest  permissible  value. 
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3.2.5      The  Cost  of  Capital  with  Monitoring 

The  lender's  participation  constraint  (2)  now  takes  the  form: 

^^T_^MiK,D/K,m)_  ^^^ 

p  Kp 

This  equation  defines  R{r,K,p,m),  the  interest  rate  for  a  borrower  with 
a  fixed  W  who  wants  to  invest  an  amount  K  and  promises  to  choose  a 
project  p.  Using  this,  we  can  define  the  expected  cost  of  credit  per  unit  of 
investment:  C{r,K,p,m)  =  pR. 

This  formulation  of  the  supply  side  of  credit  has  the  obvious  advantage 
that  the  interest  rate  can  be  much  higher  than  the  cost  of  capital  even  if 
defaults  are  rare.  This  is  because  monitoring  costs  can  be  very  high — indeed 
the  reason  why  there  is  very  little  default  may  be  a  result  of  the  resources 
spent  on  monitoring. 

It  is  useful  to  begin  our  analysis  of  this  model  with  an  examination  of 
the  properties  of  the  C(-)  function.  Simple  differentiation  tells  us: 

dR     _     1 

af      ~  p-p{R,F{K)/K)      dM        ^"""^ 

^  K  d{D/K) 


dR  dM/d' 


rn 


dm  A'[jD        P-P(R,F(K)/K)      dM 


K  d{D/K)i 

Since  [p  —  ^""^'  '  ^' — ^ —  afo/K)!  ^  P^  ^^^^  *®^^  ^^  *^^^  increases  in  the  cost 
of  lending  (represented  by  a  rise  in  p  or  in  rn)  have  a  multiplier  eflFect, 
resulting  in  a  bigger  increase  in  the  interest  rate  than  would  be  warranted 
by  the  direct  effect  of  the  increase  in  cost.^'  This  is  because  the  initial  rise 
in  the  interest  rate  worsens  the  borrower's  incentives  and  makes  it  necessary 
that  he  be  monitored  more,  which  raises  the  cost  of  lending  even  further, 
etc.  This  property  is  obviously  also  inherited  by  the  C  {■)  function. 

Property  5.  Multiplier:  The  interest  rate  and  the  amount  of 
monitoring  can  be  very  sensitive  to  changes  in  the  cost  of  capital 
and/or  the  cost  of  monitoring. 

This  is  an  important  property:  It  tells  us  that  there  are  even  relatively 
small  differences  in  the  monitoring  cost,  or  the  cost  of  capitcd  can  induce  a 
lot  of  variation  in  the  interest  rate,  which  helps  to  explain  why  we  observe 
so  much  variation. 


^''in  principle,  this  increase  can  be  very  large  since  [1  -  P  ''^^'^^''^^^^^  g-p^^.J  can  be 
very  close  to  0  or  even  negative  (in  which  case,  the  equilibrium  interest  changes  discon- 
tinuously). 
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A  related  and  important  property  of  the  C  {■)  function  comes  from  dif- 
ferentiating equation  5  with  respect  to  p.  This,  after  some  algebraic  manip- 
ulations, gives  us: 

dC  _    1       dM     p{R,F{K)/K)RK  -pF{K)E'{p) 
~dp  ~  K^d(D/K)  ip-p{R,F[K)/K))     dM  • 

^      '       ^  -f  K  d(D/K) 

Equation  1  tells  us  that  E'{p{R,F{K)/K))F{K)  =  RK.  Using  this  (assum- 
ing that  p-  J^-qMMj^{p-p{R,F{K)IK))  >  0),  it  is  immediately  clear  that 

the  sign  of  ^  depends  on  the  sign  oi  p{R,F{K)/K)E'{p{R,F{K)/K))  - 
pE'{p).  Since  p  >  p{R,F{K)/K)  it  follows  that  Cp  can  only  be  positive  if 
the  function  pE'{p)  is  a  decreasing  function  of  p  over  a  range. 

This  makes  it  clear  that  it  is  entirely  possible  that  Cp  be  negative  for  all 
p,  implying  that  implementing  high  values  of  p  may,  paradoxically,  require 
less  monitoring  than  implementing  lower  values.  This  is  because  a  high  p 
generates  a  low  R  and  this  improves  incentives. 

In  such  situations  it  may  be  optimal  to  raise  p  all  the  way  to  its  maxi- 
mum, i.e.  to  pi-  In  particular,  this  will  be  true  as  long  as  E{p)  is  everywhere 
increasing  in  p  over  its  admissible  range^^  and  it  will  remain  true  irrespective 
of  how  costly  it  is  to  monitor}'^  However,  it  is  easy  to  see  that  it  will  never 
be  optimal  for  p  to  exceed  its  social  welfare  maximizing  level,  i.e  the  value 
of  p  for  which  E'{p)  =  0.  This  is  because  when  E'{p)  —  0,  Cp  is  clearly 
positive. 

Property  6.  Default:  Very  low  levels  of  default  may  be  optimal 
even  when  monitoring  is  quite  costly,  though  it  is  never  optimal 
to  have  less  default  than  in  the  social  optimum. 

This  is  important  because  it  tells  us  that  it  is  often  optimal  to  aim  for 
very  low  default  rates  even  at  the  cost  of  lots  of  costly  monitoring  and  high 
interest  rates.  This  is  reassuring,  given  that  the  combination  of  very  low 
default  rates  and  very  high  interest  rates  is  by  no  means  uncommon.'''' 

3.2.6      The  Optimal  Credit  Contract  with  Monitoring 

The  optimal  credit  contract  will  be  a  combination  {K,p)  that  maximizes 

F{K)E{p)  -  pR{T,  K,  p,  m)KM. 
^*For  example,  Cp  is  negative  whenever  E(p)  takes  the  form  Ap^ ,  with  ^  >  0  and 

/3e(o,i). 

^'Of  course,  this  is  conditional  on  the  loan  contract  being  viable,  which  is  not  the  case 
when  monitoring  is  too  costly. 

'"Aleem's  data  set  from  Pakistan,  mentioned  above,  is  an  example. 
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The  first  order  conditions  that  describe  the  optimal  contract  (when  it  is  not 
a  corner  solution)  are 

F{K)E'{p)  =  KCp  and  (6) 


F'{K)E{p)  =  C  +  KCk.  (7) 

There  is,  however,  relatively  little  that  we  can  say  about  the  optimal 
credit  contract  at  this  level  of  generality.  The  problem  is  easily  seen  from 
equation  5:  An  increase  in  K  affects  both  the  numerator  and  the  denom- 
inator of  the  expression  — (k'-vv)p  '  ^^'^  without  more  structure  it  is  not 
possible  to  say  anything  about  now  more  investment  affects  the  expected 
cost  of  lending. 

The  Model  with  Constant  Returns  in  Monitoring  One  simple  and 
fruitful  way  to  impose  structure  is  to  assmne  constant  returns  in  monitoring, 
i.e., 

M{K,D/K,m)  =  KM{D/K,m). 

For  the  most  part,  we  will  also  assume  that  there  are  constant  returns  in 
production,  i.e.,  F{K)  =  aK.  In  this  case,  equation  5  can  be  rewritten  in 
the  form: 

pR  =  T  +  M{a[E{p{R,a)  -  E{p)]  -  \p{R,a)  - p]R,m). 

R  is  therefore  a  function  of  a,  m,  p  and  F,  and  so  is,  therefore,  the  expected 
cost  of  lending  C.  It  follows  that  keeping  p  and  F  fixed,  doubling  the  bor- 
rower's wealth  and  the  amount  he  invests  does  not  change  the  unit  cost  of 
lending.  It  follows  that  all  borrowers  with  the  same  a  and  the  same  m,  will 
choose  the  same  leverage  ratio  and  face  the  same  interest  rate.  In  other 
words,  under  full  constant  returns  the  rich  and  the  poor  will  pay  the  same 
rate  of  interest  as  long  as  they  are  equally  productive.  The  rich  will  simply 
invest  more. 

The  direct  prediction  of  this  model  is  the  absence  of  a  correlation  between 
the  borrower's  wealth  and  the  interest  rate.  Of  course,  as  we  will  see  later, 
it  does  not  rule  out  the  possibility  of  a  spurious  correlation,  induced  by  a 
correlation  between  W  and  either  a  or  m.  Nevertheless,  this  result  provides 
a  useful  benchmark:  It  tells  us  that  there  is  no  necessary  reason  why  the 
rich  should  pay  lower  interest  rates,  as  observed  in  the  data. 
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The  Model  with  a  Fixed  Cost  of  Monitoring  A  model  that  manages 
to  account  for  most  of  the  observed  fairly  economically  is  one  where  there 
is  a  fixed  cost  of  monitoring  which  has  to  be  paid  as  long  as  there  is  some 
borrowing  and  a  variable  cost  which,  as  before,  exhibits  constant  returns 
i.e., 

M{K,  D/K,  m)  =  KM{D/K,  m)  +  $. 

In  this  case,  equation  5  can  be  rewritten  to  read 

pR  =  T  +  ^/K  +  M{a[E{p{R, a)  -  E{p)]  -  \p{R,  a)  -  p]R,  m).       (8) 

With  these  assumptions,  we  run  the  risk  that  the  lender's  maximization 
problem  may  not  be  convex:  To  see  why,  note  that  T  +  ^/K  =  p{l  — 
W/K)  +  ^/K,  which  tells  us  that  if  $  >  pW ,  R  goes  down  when  K  goes 
up,  encouraging  the  borrower  to  borrow  even  more.  Conversely,  a  borrower 
who  borrows  httle  will  pay  very  high  rates,  making  it  attractive  for  him  to 
not  borrow  at  all,  suggesting  the  possibility  of  a  "bang-bang"  solution.  As 
long  as  we  make  sure  that  a  is  not  too  large  (to  avoid  the  possibihty  that 
the  demand  for  credit  becomes  infinite),  the  solution  will  be  for  the  poor 
borrower  {pW  <<<!>)  to  borrow  nothing. 

However,  there  is  an  interior  solution  for  borrowers  who  are  richer  {pW  » 
$).   It  is  easily  checked  that  this  interior  solution  has  a  very  simple  prop- 
erty:   Prom  equation  8  it  follows  that  as  long  as  p,  a,  p  and  m  are  held 
fixed,  R  is  completely  determined  by  the  term  {pW  —  ^)/K  and  therefore 
C  =  C{P  j^    ).  From  7  the  optimal  choice  of  K  satisfies 

pVF-^        pW~^       pW-^^ 

<y  =  c:(— ^-)  4-  — ^-c  (— ^-) 

in  this  case,  which  tells  us  that  ^  ^  is  uniquely  determined  by  a.  A 
number  of  properties  follow  immediately  from  this  observation.  First,  an 
increase  in  W  results  in  a  more  than  proportional  increase  in  K:  In  other 
words,  richer  people  are  more  leveraged.  Second,  higher  values  of  $  are 
associated  with  a  lower  value  of  K.  Third,  changes  in  W  and  $  do  not 
affect  R,  from  which  it  follows  that  r  goes  down  when  W  goes  up  (since 
K/W  goes  up  and  R  remains  unchanged). 

There  is  a  straightforward  intuition  behind  these  results.  Given  that 
there  is  a  fixed  cost  of  lending,  those  who  invest  more  will  face  a  lower  cost 
of  capital.  However,  for  a  poor  person  to  be  able  to  invest  the  same  amount 
as  a  richer  person,  the  leverage  ratio  would  have  to  be  much  higher  and  this 
distorts  incentives.  The  optimal  contract  balances  these  two  types  of  costs: 
The  poor  end  up  both  investing  less  and  paying  more  in  interest. 

23 


To  assess  whether  the  model  generates  the  right  orders  of  magnitude, 
the  model  was  simulated  under  the  assumption  that  E{p)  =  2p^-^,  and 
M{D/K,m)  =  mD/K.^^  For  parameter  values  p  =  1.05,  m  =  0.8,  $  =  0.5, 
(T  =  0.66,  we  find  that  those  with  wealth  levels  up  to  about  1.75  (i.e.,  about 
three  and  a  half  times  the  fixed  cost  of  monitoring)  do  not  invest  at  all. 
When  investment  begins,  the  interest  rate  is  above  50%  (and  the  leverage 
ratio  is  2.8)  and  as  the  borrower's  wealth  goes  up,  the  interest  rate  goes  down 
and  converges  to  about  27%,  while  the  leverage  ratio  rises  and  converges  to 
about  4.2. 

Property  7.  Wealth  Effects:  When  there  Eire  constant  returns 
in  both  production  and  monitoring,  two  borrowers  who  differ  only 
in  their  wealth  levels  will  be  equally  leveraged  and  will  pay  the 
same  interest  rate.  When  there  is  a  fixed  cost  of  monitoring,  richer 
borrowers  will  pay  a  lower  rate  and  will  be  more  leveraged  and 
the  very  poorest  borrower's  will  prefer  not  to  borrow  at  all. 

These  wealth  effects  have  the  implication  that  the  wealth  advantage  the 
rich  start  with  will  tend  to  get  amplified:  First,  because  the  difference  in  the 
amount  invested  is  typically  going  to  be  larger  than  the  difference  in  wealth. 
This  follows  from  the  fact  that  the  leverage  ratio  is  greater  than  1  and  either 
constant  or  increasing  in  wealth. "^^  Second,  as  long  as  R  is  increasing  in  K, 
the  marginal  product  of  capital  will  be  higher  than  the  expected  interest 
cost  in  equilibrium  and  each  unit  of  investment  generates  some  pure  profits 
(this  follows  from  equation  7).  Since  the  rich  invest  more  than  the  poor, 
they  earn  more  pure  profits. 

3.3     Taking  Stock:  How^  to  Think  about  Credit  mcU-kets 

The  model  with  a  fixed  cost  of  monitoring  gives  us  a  simple  way  of  accounting 
for  the  facts  that  are  listed  above.  Moreover,  it  helps  to  explain  the  fact  that 
short  run  interest  rates  in  informal  markets  are  often  higher  than  longer  run 
interest  rates  (for  example,  see  Table  3.2  in  Dasgupta  (1989)).  Ghate  (1992) 
also  notes  that  very  short  short-term  loans  are  often  particularly  expensive., 
In  the  face  of  it,  this  is  puzzling  because  one  would  imagine  that  the  scope  for 
moral  hazard  is  greater  in  longer  term  contracts.  The  fixed  cost  approach 
can  resolve  this  puzzle  as  long  as  there  is  a  part  of  the  fixed  cost  that  is 

Note  that  pE'{p)  is  increasing  in  p  so  that  it  is  always  optimal  to  choose  the  highest 
possible  value  of  p.  We  set  this  value  to  be  0.9,  so  that  the  default  rate  is  fixed  at  10% 
(which  is  high  but  within  the  observed  range). 

Note  however  that  this  property  may  not  hold  if  all  the  wealth  was  not  liquid.  In  that 
case,  the  leverage  ratio  may  be  less  than  1 . 
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transaction-specific  and  independent  of  tfie  length  of  the  contract. 

However,  not  surprisingly,  this  is  not  the  only  way  to  account  for  these 
facts.  For  example,  if  the  production  function  were  concave  rather  than 
linear,  the  rich  would  be  less  leveraged  than  the  poor  (since  diminishing 
returns  sets  in  at  high  levels  of  investment).  As  a  result,  the  interest  rate 
they  face  will  tend  to  be  lower.  While  they  are  less  leveraged,  the  absolute 
amount  that  the  rich  borrow  may  still  be  higher.  High  levels  of  credit  will 
therefore  be  associated  with  lower  interest  rates.  Similar  patterns  may  arise 
if,  for  example,  it  is  cheaper  to  monitor  the  rich  because,  say,  the  rich  share 
closer  social  ties  with  those  who  are  lending. 

In  any  case,  there  is  no  reason  why  we  should  confine  ourselves  to  mod- 
els of  ex  ante  moral- hazard.  Ex  post  moral  hazard  (borrowers  who  try  to 
avoid  repaying)  is  clearly  an  important  aspect  of  credit  markets  and  so  is 
adverse  selection. ^^  There  are  also  other  ways  to  model  ex  ante  moral  haz- 
ard: Holmstrom-Tirole  (1996)  develop  a  model  where  the  borrower  wants 
to  put  in  less  effort  rather  than  take  too  much  risk.  Many  of  the  basic  pre- 
dictions of  our  model  show  up  in  these  models  as  well.  Not  surprisingly,  less 
leveraged  borrowers,  all  else  being  the  same,  tend  to  be  better  credit  risks 
and  face  lower  interest  rates  at  the  same  level  of  borrowing.  Consequently, 
richer  borrowers  will  have  their  initial  advantage  compounded  by  the  work- 
ings of  the  credit  market.  High  interest  rates,  as  before,  make  the  borrower 
more  likely  to  misbehave  in  these  models  as  well.^"^  This,  in  turn,  tends 
to  raise  the  interest  rate  (either  because  default  becomes  more  likely  or  be- 
cause more  monitoring  is  called  for).  Therefore,  what  I  call  the  multiplier 
property  of  interest  rates — namely  the  fact  that  small  increases  in  the  costs 
of  lending  can  lead  to  a  large  increase  in  the  interest  rate — ought  to  be  true 
in  these  other  models  as  well.  Finally,  while  the  cost  of  monitoring  is  rarely 
formally  introduced  into  these  models  (Holmstrom-Tirole,  1996,  being  an 
important  exception),  it  is  intuitively  clear  that  the  shape  of  the  monitoring 
cost  function  wiU  play  a  crucial  role  and  a  fixed  cost  of  monitoring  will  have 
an  effect  similar  to  the  one  discussed  above. 

One  might  even  want  to  venture  beyond  models  where  the  borrower 
is  the  main  source  of  moral  hazard.  The  paper  by  Hohnstrom  and  Tirole, 
mentioned  above,  worries  about  the  incentives  of  intermediaries  in  the  credit 

'Hart-Moore  (1994)  is  the  best-known  model  of  ex  post  moral  hazard  in  the  credit 
market,  while  Stiglitz  and  Weiss  (1981)  is  the  classic  reference  on  adverse  selection. 

''' Though  as  pointed  out  in  Aghion-Banerjee-Piketty  (1999),  in  models  of  ex  post  moral 
hazard,  there  is  a  possible  countervailing  effect  coming  from  the  fact  that  high  interest 
rates  make  it  more  credible  for  the  lender  to  put  a  lot  of  effort  into  pursuing  recalcitrant 
borrowers. 
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market.  They  argue  that  these  problems  translate  into  a  further  credit 
constraint,  this  time  at  the  level  of  the  intermediary.  Moreover,  the  typical 
intermediary  is  large,  and  if  the  intermediary  itself  has  the  right  incentives, 
its  agents  who  do  the  actual  business  of  lending-the  loan  officers  and  credit 
appraisers  of  the  world-may  not,  since  what  they  have  at  stake  personally 
is  only  a  very  small  part  of  the  amount  of  money  they  control.  The  solution 
is  typically  to  restrict  the  domain  they  control:  Stringent  bureaucrat  rules 
about  what  loan  officers  can  and  cannot  do,  are  a  feature  of  credit  market 
intermediaries  the  world  over.  This,  of  course,  comes  at  a  cost.  Credit 
decisions  become  bureaucratized  and  typically  much  less  responsive  to  all 
but  the  hardest  information.  Assessments  of  the  quality  of  the  project  and 
judgements  about  future  profitability,  both  relatively  soft  information,  will 
tend  to  have  little  impact  on  who  gets  credit.  To  the  extent  that  other 
institutions,  such  as  venture  capitalists,  do  not  pick  up  the  slack,  this  will 
hurt  new  entrants  and  the  most  radical  ideas.  This  problem  may  also  be 
most  serious  where  the  banking  sector  is  dominated  by  the  public  sector, 
given  that  there  is  already  a  tendency  towards  bureaucratization  and  buck- 
passmg."*^ 

Of  course  much  more  work  (simulations,  etc.)  is  called  for  before  we 
can  be  sure  that  these  alternative  theories  can  generate  the  right  orders  of 
magnitude.  But  in  one  sense  there  is  no  reason  to  pose  these  as  alternatives. 
All  of  them  working  together  can  generate  a  larger  aggregate  effect,  and 
large  aggregate  effects  are  clearly  important  in  giving  relevance  to  this  class 
of  theories.  However,  from  the  point  of  view  of  actually  designing  policy  it 
is  important  to  know  exactly  where  the  constraint  lies.  It  is  also,  both  from 
the  point  of  view  of  macro-relevance  and  micro-pohcy  design,  important 
to  identify  the  exact  structure  of  the  credit  constraint.  Is  the  amount  of 
credit  that  a  borrower  can  get  primarily  a  function  of  the  borrower's  wealth 
and  his  expected  profitability,  as  our  model  suggests,  or  is  it  the  case  that 
profitability  is  largely  ignored,  as  the  simple  bureaucratic  model  sketched 
above  would  suggest?  How  important  is  the  distinction,  ignored  in  our 
analysis  between  wealth  and  liquid  wealth — one  additional  reason  why  the 
poor  may  suffer  is  that  their  wealth  may  be  less  liquid?^^  Is  the  borrower's 

'^For  evidence  that  this  is  a  very  real  problem,  see  Banerjee  and  Duflo  (2001).  There  are 
also  lots  of  anecdotes  that  support  this  view:  It  is  said,  for  example,  that  Indian  bankers 
in  the  1980s  and  early  1990s  were  puzzled  by  how  they  could  justify  lending  to  software 
companies,  since  their  only  real  assets  were  their  staff  and  work-in-progess  was  lines  of 
code  on  the  computer. 

^^The  work  by  Swaminathan,  already  cited,  suggests  that  this  may  indeed  be  an  im- 
portant distinction.    The  requirement  that  the  wealth  be  held  as  liquid  collateral  also 
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wealth  the  key  ingredient  or  is  it  his  inventories,  as  in  the  classic  models 
of  inventory  financing?'^^  The  answer  to  each  of  these  questions  has  many 
important  ramifications,  and  careful  empirical  research  on  the  technology  of 
lending  remains  one  of  the  imperatives  of  the  day. 

Theme  1:  The  observed  patterns  in  credit  markets — low^  de- 
fault rates,  high  and  highly  variable  interest  rates  and  credit  limits 
that  increase  with  wealth — suggest  that  contracting  in  credit  mar- 
kets is  highly  imperfect  and  monitoring  is  very  important.  This 
suggests  that  there  will  be  under-investment,  and  a  significant 
part  of  the  output  produced  will  be  wasted  on  monitoring.  The 
earnings  gap  between  the  rich  and  the  poor  will  be  amplified  by 
the  capital  market  imperfection  and  this  will  be  particularly  the 
case  if,  as  seems  plausible,  there  are  fixed  costs  of  monitoring. 

4     The  Dynamics  of  Wealth  Accumulation 

One  important  message  from  the  previous  section  is  that  the  poor  are  at 
a  disadvantage  in  the  credit  market.  However,  as  we  have  seen,  the  exact 
form  of  the  disadvantage  tends  to  depend  on  the  technologies  of  production 
and  monitoring.  For  example,  in  the  case  where  there  are  constant  returns 
in  both  production  and  monitoring,  the  disadvantage  takes  the  form  of  a 
proportional  reduction  in  the  amount  they  invest  with  no  difference  in  the 
interest  rate  or  the  choice  of  projects,  whereas  a  model  with  a  fixed  cost 
of  monitoring  generates  variations  in  the  interest  rate  and  in  the  choice 
of  projects.  In  the  short  run,  however,  aU  versions  of  the  capital  market 
imperfection  have  the  common  implication  that  the  poor  will  be  under- 
rewarded  for  their  talent. 

The  longer  run  implications  of  the  different  models  are,  however,  poten- 
tially very  different.  To  see  this,  imagine  a  world  where  there  is  one  good 
produced  and  a  population  of  identical  people  who  each  live  for  one  period 
and  always  have  one  child.  Each  person  starts  hfe  with  an  endowment  that 
her  parent  gave  her.  Her  life  is  simple,  verging  on  the  drab.  At  the  beginning 
of  her  life,  she  chooses  among  income  earning  opportunities.  Her  choices  vidll 
be  to  either  invest  in  a  productive  opportunity  or  to  lend  out  the  money. 

creates  a  demand  for  coUateralizable  assets  and  shifts  in  the  relative  price  of  these  assets 
become  important  (see  Kiyotaki  and  Moore,  1997,  for  a  macroeconomic  model  based  on 
this  particular  relative  price  shift). 

"Based  presumably  on  the  idea  that  loans  have  to  be  fully  collateralized  and  inventories 
can  serve  as  collateral.  Clearly  the  plausibility  of  this  model  depends  on  how  easy  it  is  to 
attach  inventories. 
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The  exact  technology  of  production  will  be  discussed  later,  but  it  could  be 
thought  of  either  as  investing  in  learning  a  skill,  starting  a  business  or  even 
patenting  or  promoting  a  new  idea.  At  the  end  of  the  period,  she  decides 
on  what  to  do  with  her  realized  income,  which  consists  of  her  investment 
earnings  plus  an  endowment  e  -  she  can  leave  it  to  her  child  or  eat  it  her- 
self. For  simplicity,  assume  that  she  has  Cobb-Douglas-like  preferences  over 
consumption  (c)  and  bequest  (6): 

U{c,b)  =  A[c^~^l/],0  <P<1,A>0. 

Since  she  allocates  her  end-of-period  wealth  between  these  two  uses,  this 
immediately  implies  that  if  her  end-of-period  income  (or  wealth)  is  y: 

c  =  (1  -  P)y,         b  =  (5y. 

People  borrow  to  invest  more  than  what  they  were  born  with.  Assume  that 
the  credit  market  is  exactly  as  modeled  in  the  previous  section. 

These  assumptions  together  define  a  simple  dynamic  process  that  maps 
Wf ,  the  wealth  of  an  individual  from  the  current  generation  (which  is  equal 
to  the  bequest  he  received),  into  the  wealth  of  his  child,  W(+i.  The  exact 
shape  of  this  map  will  depend  on  what  we  assume  about  technology,  which 
is  what  we  turn  to  now. 

First  consider  the  case  where  both  the  production  technology  and  the 
monitoring  technology  are  fully  linear.  In  this  case  we  know  that  the  optimal 
leverage  ratio,  the  optimal  level  of  monitoring,  and  the  optimal  choice  of  p 
are  all  independent  of  the  investor's  wealth.  To  simplify  life,  assume  that 
the  optimal  choice  of  p  is  1.  Denote  the  monitoring  cost  required  to  sustain 
this  by  m  (per  dollar  lent).  Finally,  assume  that  the  production  technology 
takes  the  form  F{K)  =  E{l)K  and  that  the  optimal  leverage  ratio  is  A.  In 
this  case,  the  map  from  current  wealth  to  future  wealth  is  given  by 

Wt+i  =  f3[e  +  KWt], 

where  K.  =  E{1)X- {X-l)[p  +  m). 

In  the  case  where  /?«  <  1,  this  is  a  process  that  converges  to  a  wealth 
level  (5e/[l  —  (3k):  Every  dynasty  ends  up  with  the  same  wealth  in  the  long 
run.  On  the  other  hand,  in  the  case  where  (3k  >  1  no  dynasty  ever  converges 
but  in  the  long  run  the  wealth  of  every  dynasty  grows  at  the  same  rate  and 
everyone  becomes  extremely  wealthy.  There  are  no  poverty  traps  in  this 
model.'**  The  dynamics  of  wealth  in  this  model  are  shown  in  Figure  1. 

However,  this  model  is  consistent  with  divergence  across  countries:  Those  countries 
that  have  a  better  financial  system,  and  therefore  a  lower  m,  will  grow  faster. 
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Insert  Figure  1  here. 

The  model  behaves  in  much  the  same  way  when  the  production  and 
monitoring  technologies  are  strictly  convex  (i.e.,  both  functions  are  strictly 
concave).  A  simple  way  to  see  this  is  to  note  that  the  poor  always  have  the 
option  of  choosing  the  same  leverage  ratio  as  the  rich.  If  they  were  to  choose 
the  same  leverage  ratio,  they  would  actually  pay  less  in  interest  than  the 
rich,  since  they  have  a  higher  marginal  product  of  capital  and  monitoring 
them  is  easier  (diminishing  returns  in  monitoring).  Their  net  return  per 
dollar  of  their  wealth  would  therefore  be  higher  and  as  a  result,  the  wealth 
of  the  poor  will  grow  faster  than  that  of  the  rich.^^ 

Things  change  significantly  when  at  least  one  of  the  technologies  stops 
being  convex.  To  take  an  example,  assume  that  the  production  technology 
remains  what  it  was  in  the  previous  example  but  introduce  a  fixed  cost  of 
monitoring.  Under  these  assumptions  the  equation  representing  the  evolu- 
tion of  wealth  for  those  who  can  borrow  can  be  written  in  the  form: 


Wt+i  =  P[e  +  E{l)Kt  -  {Kt  -  Wt)r],^  (3e  +  i3\{Wt){E{l)  -  R). 

As  shown  in  the  previous  section,  when  there  is  a  fixed  cost  of  monitoring 
R  is  constant  but  A  is  an  increasing  function  of  wealth  and  converges  to  a 
A  as  wealth  continues  to  go  up.  For  wealth  below  some  W* ,  people  do  not 
borrow  and  their  wealth  evolves  according  to 

Wt+i=Pe  +  pWtE{l). 

It  is  easy  to  see  that  this  generates  a  map  which  has  the  convex  shape  in 
figure  2. 

Insert  Figure  2  here 

The  wealth  dynamics  implied  by  this  picture  are  very  different  from 
those  generated  by  the  linear  model._The  rich  get  richer  all  the  time  but  the 
poor  converge  to  a  steady  state  at  W.  However,  it  is  not  neccessary  that 
inequality  grows  without  a  bound:  It  is  straightforward  to  add  to  this  model 

'"This  claim  does  depend  in  an  important  way  on  the  fact  that  there  is  only  one  type 
of  investment.  Mookherjee  and  Ray  (2000)  show  that  if  there  are  many  alternative  types 
of  investment  and  the  economy  needs  all  of  them,  long  run  inequality  may  be  inevitable 
in  a  world  of  imperfect  capital  markets,  even  if  all  the  individual  technologies  are  convex 
and  people  are  forward-looking  in  their  savings  decisions. 
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a  technological  limit  on  investment,  K.  It  is  easily  checked  that  what  tliis 
does  is  to  reduce  the  amount  borrowed  by  those  who  would  have  otherwise 
invested  more  than  K.  Eventually,  when  someone's  wealth  exceeds  K,  they 
will  stop  borrowing  and  start  lending,  which  causes  the  Wt+i  schedule  to 
take  the  form 

Wt+i  =  0[e  +  E{1)K  +  {Wt  -  K)p]. 

As  long  as  Pp  <  1,  this  implies  that  wealth  will  remain  bounded:  By  the  fact 
that  the  curve  is  continuous,  it  follows  that  it  must  be  S-shaped.  Depending 
on  the  parameters,  this  model  has  either  one  or  three  steady  states,  of  which 
the  two  extreme  ones  are  stable.  When  there  are  two  stable  steady  states, 
the  poor  converge  to  the  lower  one  and  the  rich  to  the  upper,  reflecting  the 
fact,  noted  above,  that  when  there  is  a  fixed  cost  of  monitoring  the  poor 
earn  a  lower  return  on  their  wealth. 

Insert  Figure  3  here. 

This  is  the  classic  poverty  trap  situation:  The  poor  remain  poor  and 
the  rich  stay  rich.  Versions  of  this  story  have  been  told  in  many  papers, 
including  Galor  and  Zeira  (1993),  Dasgupta  and  Ray  (1986),  and  Banerjee 
and  Newman  (1994),  though  they  each  have  a  different  name  for  the  in- 
vestment: education  in  Galor-Zeira;  health  in  Dasgupta-Ray;  and  capital  in 
Banerjee-Newman.  Both  Galor-Zeira  and  Dasgupta-Ray  have  exphcit  non- 
convexities  in  the  production  function  while  Banerjee-Newman  introduces 
the  non-convexity  into  the  technology  of  lending.  Moav  (1999)  presents  a 
slightly  different  version  of  the  argument  which  relies  on  non-convexities  in 
what  one  might  call  the  bequest  technology. '*'' 

Of  course  there  are  good  reasons  not  to  take  poverty  traps  literally.  The 
very  lucky  and  the  very  talented  among  the  poor  will  probably  manage 
to  escape  their  background,  and  some  of  the  rich  will  surely  manage  to 
squander  their  patrimony.  The  robust  imphcation  of  this  model  is  rather 
that  economic  mobility  will  be  slow.  An  economy  that  starts  with  a  lot 
of  poor  people  will  remain  both  poor  and  unequal  for  a  very  long  time. 
This,  however,  raises  the  question  of  whether,  in  the  end,  this  model  is  very 
different  from  the  model  where  all  the  technologies  are  convex  and  there  is 
no  poverty  trap,  but  the  capital  markets  are  extremely  inefficient — after  all 

''  In  terms  of  our  notation,  the  assumption  he  makes  is  that  the  amount  bequeathed  is 
a  strictly  convex  function  of  end-of-period  income  over  some  range,  whereas  we  have  so 
far  assumed  that  it  is  a  hnear  function.  It  is  easy  to  see  that  this  can  generate  a  poverty 
trap-essentially  the  rich  benefit  from  the  fact  that  rich  parents  leave  bequests  that  are 
disproportionately  large  relative  to  their  wealth. 
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when  capital  markets  do  not  work  very  well,  convergence  can  take  a  very 
long  time. 

My  view  is  that  it  is  nevertheless  worth  distinguishing  between  these  two 
models  because  the  forms  of  mobihty  that  they  permit  are  rather  different. 
The  convex  model  predicts  a  slow  and  steady  rise  for  all  the  poor  which 
culminates  in  their  catching  up  with  the  rich  (or  if  there  is  no  convergence, 
they  still  become  very,  very  rich).  The  mobility  in  the  model  with  non- 
convexities,  by  contrast,  comes  from  those  who  are  either  very  talented  or 
very  lucky.  In  other  words,  it  takes  the  form  of  large  jumps  by  a  relatively 
few  people.'*^  This  difference  also  shows  up  in  the  shape  of  the  long  run 
distributions.  In  the  convex  model,  most  people  in  the  long  run  will  be 
"middle-class"  with  some  outliers  who  are  either  very  lucky  or  very  unlucky. 
By  contrast,  in  the  other  model  most  people  will  either  be  rich  or  poor. 

Theme  2:  Models  where  all  the  key  technologies — the  pro- 
duction technology,  the  monitoring  technology  and  the  bequest 
technology — are  all  linear  or  convex  tend  to  favor  long  run  conver- 
gence: Those  of  comparable  talent  will  earn  comparable  amounts 
in  the  long  run  and  the  long  run  distribution  of  income  will  re- 
flect the  distribution  of  abilities  in  the  population.  By  contrast, 
models  where  at  least  one  of  these  technologies  is  non-convex  can 
generate  poverty  traps  at  the  level  of  the  individual.  People  who 
start  poor  stay  poor,  and  therefore  equally  able  people  may  earn 
very  different  cimounts  even  in  the  long  run. 

4.0.1      Endogenous  Savings  and  Poverty  Traps 

The  one  obviously  unsatisfactory  aspect  of  our  model  so  far  is  the  modeling 
of  bequest  decisions.  While  the  way  we  have  modeled  them  is  extremely 
convenient  and  there  is  no  good  reason,  either  empirical  or  a  priori,  to 
switch  to  full  "Barro"  preferences,  there  are  clearly  cases  where  our  model 
seems  a  bit  strained.  In  the  specific  example  in  the  previous  section  where 
monitoring  has  a  fixed  cost,  the  rate  of  return  on  beginning-of-period  wealth 
(i.e.,  bequests)  varies  enormously  and  those  who  start  with  more  will,  over 
a  range,  earn  a  higher  return.  It  is  therefore  plausible  that  parents  will  take 
this  into  account  when  planning  their  bequests. 

To  see  what  changes  when  we  allow  for  endogenous  savings  decisions, 

■"Paulson  and  Townsend  (2000)  and  Jeong  and  Townsend  (2000)  make  a  related  but 
different  distinction  between  models  (such  as  Evans  and  Jovanovic,  1989)  where  poor  cap- 
ital markets  hurt  the  most  talented  people  and  models  (such  as  Lloyd-Ellis  and  Bernhardt, 
2000)  where  poor  capital  markets  hurt  most  the  least  talented. 
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consider  a  modification  of  our  basic  model  that  makes  the  agents  infinitely- 
hved  and  endows  them  with  the  standard  forward-looking  preferences.  In 
the  case  where  the  production  technology  is  convex  and  credit  markets  are 
absent,  this  is  a  special  case  of  the  model  studied  by  Loury  (1981)  where  he 
showed  that  there  is  convergence  despite  the  credit  markets  being  absent. 
Indeed,  as  emphasized  by  Caselli  and  Ventura  (1996),  the  presence  of  effec- 
tive credit  markets  in  such  an  environment  may  actually  slow  or  even  stop 
convergence.  The  point  is  that  poor  capital  markets  can  act  as  a  spur  to 
savings,  since  they  make  it  more  important  to  have  one's  own  wealth  and 
this  effect  is  strongest  for  the  poor,  since  capital  is  most  productive  in  their 
hands.  By  contrast,  when  capital  markets  are  perfect  the  marginal  product 
of  capital  is  equalized  everywhere  and  the  poor  have  no  more  incentive  to 
save  than  the  rich.''" 

When  we  combine  this  model  of  savings  with  a  non-convex  monitoring 
technology,  things  change  dramatically.  As  noted  above,  the  rate  of  retiurn 
on  savings  is  now  lower  for  the  poor  than  for  those  who  are  somewhat  richer. 
This  is  especially  true  of  the  very  poor,  who  cannot  borrow  at  all.  As  a 
result,  the  rich  (or  at  least  the  middle  classes,  since  the  very  rich  also  earn 
low  returns  on  their  savings)  will  save  a  higher  proportion  of  their  income 
than  the  poor,  which  reinforces  the  poverty  trap.''^ 

Theme  3:  If  savings  decisions  are  based  on  future  benefits, 
capital  market  imperfections  can  actually  promote  savings  and 
convergence.  However,  if  the  technology  of  production  or  moni- 
toring is  non-convex,  the  encoviragement  efTect  may  only  operate 
on  the  relatively  wccdthy.  The  poorest  get  low  retm-ns  from  cap- 
ital and  therefore  will  save  relatively  little,  which  may  reinforce 
the  poverty  trap. 

4.0.2      Endogenous  Prices  and  Collective  Poverty  Traps 

One  implication  of  there  being  a  poverty  trap  at  the  level  of  the  individual 
is  that  there  is  also  a  collective  poverty  trap:  an  economy  that  starts  with 
a  lot  of  poverty  will  end  up  with  a  lot  of  poverty.  Collective  poverty  traps 
can,  however,  exist  in  models  with  imperfect  credit  markets,  even  when  there 
are  no  individual-level  poverty  traps.    This  point,  first  noted  by  Banerjee 

As  pointed  out  by  Ghatak,  Morelli  and  Sjostrom  (2000),  it  can  also  act  as  a  spur  to 
hard  work,  ais  a  way  to  accumulate  capital. 

This  is  strictly  only  true  if  we  assume  that  the  average  savings  rate  for  the  economy 
is  still  /5,  but  now  the  poor  and  the  rich  have  different  savings  rates. 
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and  Newman  (1993),'^'*  relies  on  the  fact  that  in  a  world  where  people  are 
credit-constrained,  factor  prices  depend  on  the  wealth  distribution  (since 
the  demand  for  factors  depends  on  who  has  how  much  wealth).  But  the 
wealth  distribution  in  any  economy  depends  on  factor  prices — this  two-way 
interaction  creates  the  possibility  of  multiple  steady  states. 

To  see  exactly  how  this  might  happen,  consider  a  variant  of  the  model 
developed  above  with  linear  production  and  monitoring  technologies.  The 
one  factor  price  in  that  model  was  the  cost  of  capital,  which,  so  far,  we 
took  as  given.  To  endogenize  the  interest  rate,  assume  that  the  supply  of 
capital  comes  from  a  fixed  fraction  of  the  population  who  cannot  invest 
in  the  linear  technology.  Assume  that  in  every  generation,  a  fraction  ^ 
are  handicapped  in  this  way,  but  tliis  attribute  is  neither  correlated  over 
time  nor  correlated  with  their  wealth.  These  people  (and  everyone  else) 
do  have  an  alternative  investment  possibility  which,  for  want  of  a  better 
description,  we  will  call  "land":  The  return  from  investing  in  land  is  given 
by  a  strictly  concave  production  function  H{iJ.K),  where  K  is  the  average 
amount  invested  in  land  by  each  investor  who  invests  in  land.  Since  this  is 
a  completely  safe  investment  it  will  earn  the  safe  rate,  p,  on  the  marginal 
unit,  i.e.,  p  =  H'{^iK).  This  allows  us  to  write  p  =  p{K),  p'  <  0.  The  rest  of 
the  available  wealth  in  the  economy  will  be  invested  in  the  linear  production 
technology.  In  other  words,  if  W  is  the  per  capita  wealth  in  the  economy, 
and  if  K  denotes  the  average  amount  that  each  investor  puts  into  the  hnear 
production  technology,  the  market  clearing  condition  for  the  credit  market 
will  be:  _ 

fik  =  W-{l-p)K  =  W-{l-p)X{p{K))W,  (9) 

where  X{p)  is  the  optimal  leverage  ratio  from  the  point  of  view  of  borrowers 
in  an  economy  where  the  cost  of  capital  is  p. 

How  about  the  evolution  of  wealth  in  this  economy?  Given  all  the  as- 
sumption that  have  been  already  made,  this  turns  out  to  be  quite  straight- 
forward: 

Wt+i  =  P[e  +  H{pK{Wt))  +  {Wt  -  fxK{Wt)){aE{pip{K)))  -  M{p{k)))], 

where  'p{p)  is  the  optimal  choice  of  p  when  the  cost  of  capital  is  p  and 
M{p)  is  the  corresponding  level  of  monitoring  per  unit  of  capital.  Prom 
equation  9,  when  W  goes  up,  K  also  goes  up,  but  less  than  proportionally 
since  p  goes  down  and  A  goes  up.  In  other  words,  as  the  economy  gets 
richer,  a  higher  and  higher  proportion  of  its  wealth  will  be  invested  in  the 

■'■'There  can  also  be  collective  poverty  traps  in  models  where  there  are  no  credit  market 
imperfections,  if,  for  example,  there  are  peer  group  externalities  (see  Durlauf,  1996). 
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linear  production  technology.  Since  the  net  return  to  the  linear  production 
technology,  crE{p{p))  —  M{p),  is  greater  than  p  (from  equation  7)  the  average 
return  on  capital  may  go  up  as  a  result  of  the  shift  between  the  two  sectors. 
This  tells  us  that  the  iyf+i(H^f)  map  need  not  be  concave.  In  particular, 
it  can  have  the  S-shape  depicted  in  Figure  4.  This  is  most  likely  if  it  is  the 
case  that  at  low  levels  of  Wt  most  of  the  capital  is  invested  in  land,  but 
when  Wt  goes  up  beyond  a  certain  point,  the  marginal  product  of  capital 
invested  in  land  falls  off  very  quickly  and,  as  a  result,  all  additional  capital 
is  allocated  to  the  alternative  technology. 

The  S-shape  in  Figure  4  generates  what  one  might  call  a  collective 
poverty  trap.  No  individual  in  this  economy  is  ever  trapped  in  poverty 
since  all  the  technologies  are  convex.  Nevertheless,  economies  that  start 
poor  stay  poor.  This  is  because  capital  is  expensive  in  these  economies  and 
we  know  from  above  that  when  capital  is  expensive  it  is  more  likely  to  be 
used  wastefuUy  (our  Property  1). 

This  argument  is  an  instance  of  a  very  general  point:  In  all  of  these 
models,  the  distribution  of  wealth  determines  the  pattern  of  investment, 
which  in  turn  determines  the  demand  for  various  factors,  the  factor  prices, 
and  eventually  the  next  period's  distribution  of  wealth.  Since  the  effect  of 
demand  on  factor  prices  is  usually  non-linear,  the  map  from  the  present 
distribution  of  wealth  to  the  future  distribution  so  generated  will  typically 
also  be  highly  non-linear,  and  therefore  there  is  no  reason  to  expect  unique 
steady  states  in  dynamic  models  with  endogenous  price  determination.^^ 
Indeed,  there  is  no  presumption  that  these  dynamic  processes  necessarily 
converge  to  a  steady  state:  Aghion,  Banerjee  and  Piketty  (1999)  and  Aghion, 
Bacchetta  and  Banerjee  (1999)  generate  endogenous  cycles  from  models  of 
this  class. 

This  type  of  argument  can  also  be  made  using  other  prices:  Banerjee  and 
Newman  (1993),  who  were  the  first  to  emphasize  this  type  of  multiplicity, 
relied  on  the  endogeneity  of  wages.  High  wage  economies,  in  their  argument, 
allow  the  children  of  the  poor  to  become  entrepreneurs,  with  the  result  that 
the  demand  for  labor  and  wages  remains  high. 

What  is  key  in  all  of  these  cases  is  a  perverse  price  effect:  In  the  presence 
of  capital  market  imperfections,  price  changes  can  have  powerful  wealth 
effects  which  make  it  possible  that  an  increase  in  the  price  actually  leads 
to  an  increase  in  the  excess  demand.  Thus,  in  the  just-mentioned  paper  by 
Banerjee  and  Newman  an  increase  in  the  wage  raises  the  demand  for  capital, 
while  in  the  example  represented  in  Figure  4,  a  rise  in  the  interest  rate  raises 

■"^For  a  more  elaborate  discussion  of  this  point,  see  Banerjee  and  Newman  (1993). 
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the  excess  demand  for  capital. 

Perverse  price  effects  are,  however,  no  guarantee  that  global  convergence 
will  fail.  Aghion  and  Bolton  (1997),  who  were  the  first  to  analyze  a  dynamic 
model  of  exactly  this  class  (i.e.,  with  credit  market  imperfections  and  an 
endogenous  interest  rate),  had  focussed  on  the  case  where  there  was  global 
convergence.  Their  point  was  that  the  process  of  convergence  involved  a 
Kuznets-like  sequence  of  increasing  and  then  decreasing  inequality.  Our 
example  above  is  inspired  by  the  work  of  Piketty  (1997)  who  showed  that  the 
Aghion-Bolton  model  can,  under  suitable  parametric  assumptions,  generate 
multiple  steady  states.^^  The  key  difference  seems  to  come  from  assumptions 
about  the  elasticity  of  demand  for  capital:  In  Aghion-Bolton  the  demand 
for  capital  is  relatively  elastic  and  therefore  there  are  no  sharp  changes  in 
the  interest  rate.  As  a  result,  the  economy  is  always  quite  stable."*^ 

Theme  4:  There  caji  be  collective  poverty  traps  even  when 
there  Eire  no  individual  poverty  traps.  This  is  most  likely  to  be 
the  case  if  the  supply  and  demand  for  factors  are  not  too  price- 
elastic. 

4.1      Taking  Stock:  How  Plausible  Are  Poverty  Traps? 

There  are  two  parts  to  the  answer  to  this  question.  First,  are  the  assump- 
tions of  the  poverty  trap  model  plausible?  Second,  do  the  implications  of 
the  model  correspond  to  something  we  observe?  These  are  addressed  one 
by  one. 

One  key  assumption  of  the  model  of  the  poverty  trap  at  the  level  of  an 
individual  is  a  non-convexity  in  either  the  production  function,  the  moni- 
toring fimction  or  the  bequest  function.  Of  the  three,  non-convexities  in  the 
bequest  function  are  perhaps  the  easiest  to  document.  Empirical  evidence 

■"^In  Piketty's  example  (as  in  our's),  multiple  steady  states  arise  because  high  inter- 
est rates  are  inefficient  and  this  inefficiency  reduces  the  supply  of  capital,  and  therefore 
the  interest  rate  goes  up.  Interestingly,  Matsuyama  (2000)  who  provides  an  alternative 
argument  for  multiple  steady  states  in  this  class  of  models,  reUes  on  an  argument  that 
associates  high  interest  rates  with  efficient  steady  states.  Raising  the  interest  rate  in  his 
model  allows  the  poor  to  accumulate  wealth  faster  (since  they  tend  to  be  lenders)  and 
this  increased  wealth  allows  more  poor  people  to  transition  faster  into  being  investors, 
raising  the  demand  for  capital  and  the  interest  rate.  In  other  words,  while  both  Piketty 
and  Matsuyama  build  their  argument  on  the  fact  that  in  this  type  of  model  the  excess 
demand  for  capital  can  go  up  when  the  interest  rate  goes  up,  in  Matsuyama  this  happens 
because  the  demand  for  capital  goes  up  while  in  Piketty  it  is  the  supply  that  goes  down. 

■'^ While  high  interest  rates  are  typically  associated  with  underdevelopment — which  is 
what  Piketty's  model  tells  us — neither  of  these  models  is  intended  to  be  "taken  to  the 
data" . 
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from  many  OECD  countries  supports  the  view  that  bequests  are  a  luxury 
good.  Only  the  richer  people  leave  bequests  of  any  significant  size  so  that 
the  bequest  function,  at  least  over  a  range,  is  clearly  convex. 

Non-convexities  in  production  are  certainly  very  plausible.  Most  ma- 
chines have  a  minimum  efficient  scale,  and  while  rental  markets  can  amelio- 
rate this  problem,  they  provide,  at  best,  an  imperfect  substitute.''^  In  the 
case  where  the  good  being  produced  is  a  usable  education,  there  are  several 
potential  sources  of  non-convexities:  Learning  the  letters  of  the  alphabet 
is  probably  useful  only  when  it  translates  into  the  ability  to  read  simple 
sentences.  For  this  reason,  the  first  few  years  of  education  may  not  gener- 
ate any  returns,  unless  instruction  is  continued  further.  This  is  what  Card 
and  Krueger  (1992)  find  in  U.S.  data:  The  first  five  years  of  education  have 
no  direct  return.  In  the  U.S.,  of  course,  very  few  people  plan  to  get  less 
than  five  years  of  education,  but  for  some  developing  countries  this  may  be 
an  important  non-convexity.  Angrist  and  Krueger  (1999),  once  again  using 
U.S.  data,  find  significant  jumps  in  the  return  to  education  at  school  com- 
pletion and  college  completion.  Case  and  Deaton  (1999)  also  report  that 
the  relation  between  wages  and  years  of  schooling  for  black  South  Africans 
starts  relatively  flat  and  becomes  steeper.  However  Psacharopoulos  (1994), 
concludes  in  favor  of  concavity,  based  on  range  of  studies  from  all  over  the 
world  and  Duflo  (2001)  finds  an  essentially  linear  relationship  based  on  data 
from  Indonesia.  Assesing  the  relative  merits  of  these  studies  is  beyond  the 
scope  of  this  paper  but  it  is  clear  that  the  definitive  paper  on  the  important 
issue  of  non-convexity  in  education  is  yet  to  be  written.  Note  however  that 
in  any  case  the  fact  that  there  a  non-convexity  is  only  interesting  if  it  is 
large  enough  to  be  a  hurdle  for  a  significant  number  of  people:  Ideally,  we 
would  like  evidence  that  allows  us  to  scale  the  wealth  distribution  among 
potential  investors  in  an  industry  to  the  size  of  the  non-convexity  in  that 
industry.  This  would  allow  us  to  answer  questions  such  as:  What  fraction 
of  the  population  of  potential  investors  in  this  industry  are  sufficiently  poor 
that  the  non-convexity  is  relevant  for  them? 

The  situation  in  the  case  of  the  monitoring  technology  is  even  worse 
because  very  little  is  actually  known.  Aleem  only  reports  the  average  mon- 
itoring cost  for  the  entire  population,  which  does  not  tell  us  how  the  cost 
changes  with  the  amount  borrowed.  There  are  certainly  a  priori  good  rea- 
sons to  suspect  that  some  part  of  the  cost — such  as  the  cost  of  meeting  the 
first  time  with  the  potential  borrower — is  a  fixed  cost.  Moreover,  as  already 
discussed,  fixed  costs  provide  a  natural  explanation  of  the  observed  patterns 

Machines  that  require  careful  maintenance  are  typically  not  available  for  rent. 
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in  credit  markets.  But  we  certainly  need  direct  evidence  about  the  size  of 
the  costs  that  are  incurred  by  the  lender  at  difFerent  stages  of  his  relationship 
with  borrowers  and  how  that  varies  across  borrowers. 

There  is  also  an  a  priori  argument  against  non-convexities.  Lehnert 
(1998)  has  argued  that  wherever  there  are  non-convexities,  people  should 
participate  in  lotteries,  which  will  make  all  of  them  better  off  and  elimi- 
nate both  the  non-convexity  and  the  poverty  trap.  This  is  not  as  fanciful 
as  it  might  seem.  Many  poor  people  throughout  the  world  participate  in 
ROSCAs,  which,  it  has  been  argued,  are  a  type  of  private  lottery  designed 
to  deal  with  non-convexities."^^  However,  ROSCAs  typically  have  the  fea- 
ture that  all  participants  get  to  "win"  over  a  relatively  short  period  of  time 
(at  most  a  year)  which  is  feasible  when  the  non-convexity  is  of  the  order  of 
magnitude  of  a  year's  savings  for  the  average  participant  but  not  for  larger 
non-convexities.  This  may  reflect  the  fact  that  people  are  unwilling  to  lose 
a  large  amount  of  money  in  a  single  lottery,  perhaps  because  there  is  loss- 
aversion  or  regret  in  their  preferences.  Moreover,  the  argument  only  applies 
to  the  non-convexities  in  production  and  monitoring.  What  is  described  as 
a  non-convexity  in  the  bequest  technology  is  actually  a  simple  non-hnearity 
and  is  not  subject  to  this  criticism. 

There  is  also  the  possibility  that  as  the  economy  becomes  richer  and 
richer  (through  a  process  of  technological  upgrading,  for  example),  the  non- 
convexities  will  become  less  and  less  important.  For  this  to  be  true,  the 
level  of  the  non-convexity  must  grow  less  fast  than  the  income  of  the  average 
person:  This  is  unlikely  to  be  true  if,  for  example,  the  non-convex  cost  is  a 
labor  cost  (the  cost  of  monitoring,  for  example).  Moreover,  the  process  of 
technological  upgrading  is  often  accompanied  by  an  increase  in  the  size  of 
the  fixed  cost. 

Non-convexities,  by  themselves,  do  not  guarantee  that  there  is  a  poverty 
trap.  What  we  need  for  an  individual  level  poverty  trap,  to  put  it  crudely,  is 
evidence  that  the  map  from  current  wealth  to  future  wealth  is  steep  enough 
to  cut  the  45°  line  more  than  once.  This  can  be  a  tough  criterion  to  meet. 
For  example,  Dasgupta  (1993)  has  argued,  based  on  evidence  from  studies  by 
biologists  and  nutritionists  of  the  effects  of  malnutrition,  that  the  relation 
between  the  parent's  health  (his  proxy  for  wealth)  and  the  child's  health 
tends  to  be  highly  non-linear  and  typically  includes  a  non-convex  section. 
While  this  is  important  evidence,  it  is  not  enough  to  establish  the  existence 
of  a  poverty  trap  as  interpreted  here.^*^   The  problem  is  that  the  available 

■"^See  Besley,  Coate  and  Loury  (1993). 

''"Dasgupta  (1997)  suggests  a  more  inclusive  definition  of  a  poverty  trap  in  response  to 
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estimates  of  the  elasticity  of  income  with  respect  to  nutrition  and  health, 
as  well  as  the  elasticity  of  nutrition  with  respect  to  income  tend  to  be  less 
than  1,^^  suggesting  that  there  cannot  be  an  individual  level  poverty  trap.^^ 
On  the  other  hand,  it  can  be  argued  that  this  is  of  course  only  one  of 
many  mechanisms  that  go  into  the  map  from  current  to  future  wealth.  The 
question  then  is  whether  these  mechanisms  tend  to  reinforce  each  other 
or  cancel  each  other  out — note  that  they  can  cancel  each  other  out  even 
if  there  is  no  actual  interaction  between  the  various  mechanisms,  simply 
through  a  process  of  averaging. ^'^  This  seems  to  be  an  important  area  for 
future  research. 

The  other  pillar  of  this  class  of  theories  are  wealth  effects  on  invest- 
ment. The  evidence  on  wealth  effects  on  access  to  credit  has  already  been 
discussed  at  some  length  and  it  may  be  presumed  that  these  translate  into 
wealth  effects  on  investment.  However,  it  is  not  clear  that  the  causal  factor 
here  is  necessarily  wealth  rather  than  some  correlate  of  wealth.  More  direct 
evidence  is  now  available  from  studies  of  firms  in  the  U.S.,^**  showing  that 
firms  that  get  positive  cash-flow  shocks  invest  more,  even  after  controlUng 
for  changes  in  their  productive  opportunities.  In  terms  of  developing  coun- 
try data,  there  is  a  recent  study  by  Jeong  and  Townsend  (2000),  based  on 
data  from  Thailand,  which  shows  that  the  probability  of  owning  a  non-farm 
business  is  less  than  10%  in  the  bottom  decile  and  over  30%  in  the  top 
decile.^^  It  not  clear,  however,  that  the  entire  effect  they  find  is  attributable 
to  the  direct  effect  of  wealth  rather  than  the  effect  of  characteristics  that  are 
correlated  with  wealth.  Banerjee  and  Duflo  (2001)  use  a  change  in  directed 
lending  policies  in  India  as  a  natural  experiment  to  estimate  the  effect  of 
greater  access  to  working  capital  on  profits.  They  find  that  an  extra  one 
rupee  of  credit  increases  profits,  net  of  interest,  by  more  than  one  rupee. 
Duflo  (2000)  finds  evidence  of  strong  income  effects  on  investment  in  the 
health  of  young  girls  in  South  Africa,  though  it  is  not  clear  that  it  primarily 

this  kind  of  criticism. 

*' Strauss  and  Thomas  (1993)  list  more  than  twenty  studies  that  estimate  the  relation 
between  income  and  calorie  consumption  (as  a  meaure  of  nutrition).  Each  one  of  them 
reports  an  estimate  of  less  than  1  and  most  are  less  than  0.5.  Strauss  (1986)  estimates 
the  elasticity  of  wages  with  respect  to  nutrition  in  Sierra  Leone  and  reports  an  elasticity 
of  about  0.5. 

'"Srinivasan  (1994)  makes  a  similar  point. 
For  example,  a  series  of  non-convexities  at  slightly  different  places  may  average  to  a 
convex  map. 

^""See,  for  example,  Fazzari,  Hubbard  and  Petersen  (1988),  and  Fazzari  and  Petersen 
(1993). 

^^ Evans  and  Jovanovic  (1989),  show  a  similar  result  for  the  U.S. 


38 


reflects  the  effect  on  access  to  credit.  However  these  studies  are  relatively 
recent  and  only  time  can  tell  whether  their  results  will  turn  out  to  be  robust 
enough  to  be  the  foundations  of  a  theory  of  development.  As  of  now,  as  in 
many  other  instances,  theory  seems  to  be  ahead  of  the  evidence. 

Collective  poverty  traps,  unlike  poverty  traps  at  the  level  of  the  individ- 
ual, do  not  directly  rely  on  non-convexities.  Wealth  effects  on  investment 
are,  however,  clearly  crucial  as  are  strong  price  effects,  which  in  turn  rely 
on  demand  and  supply  for  factors  being  relatively  inelastic.  While  we  do 
have  some  estimates  of  these  elasticities  (though  mainly  from  developed 
countries),  the  theory  is  yet  to  be  developed  in  a  form  that  would  allow 
calibration  using  these  estimates. 

The  other  approach  to  evidence  would  be  to  look  directly  for  poverty 
traps.  In  other  words,  we  could  study  the  rates  of  economic  mobility  at 
different  levels  of  wealth.  Or  we  could  look  at  the  evolution  of  the  wealth 
distribution  and  try  to  estimate  from  it  the  implied  parameters  of  the  un- 
derlying economy  (including  mobility  rates).  Robert  Townsend,  in  joint 
work  with  a  number  of  his  co-authors,  has  done  interesting  work  along  these 
lines. ^^  The  problem  remains  however  that  we  have  no  independent  esti- 
mate of  the  part  of  mobility  that  arises  from  sources  that  are  excluded  from 
our  model,  such  as  genetics  or  learning. ^^  For  this  reason,  a  more  promising 
approach  may  be  to  look  at  the  mobility  patterns  within  a  single  industry, 
where  it  may  be  easier  to  calibrate  some  of  the  other  sources  of  mobility. 

There  have  been  also  been  attempts  to  use  cross-country  data  on  in- 
equality and  growth  to  look  at  this  question.^®  These  exercises  suffer  from 
two  problems.  First,  a  number  of  theories,  including  those  discussed  here, 
predict  a  relation  between  inequahty  and  growth,  and  the  data  does  not  dis- 
tinguish between  these  alternative  channels.  Second,  as  argued  by  Banerjee 
and  Duflo  (1999),  there  is  severe  danger  of  misspecification,  both  because  of 
possible  omitted  variables  and  strong  non-linearities  in  the  data. 

4.2     Beyond  Poverty  Traps 

Poverty  traps  are  only  the  starkest  form  of  what  makes  the  world  with  im- 
perfect credit  markets  interesting.  As  already  noted,  the  more  general  phe- 
nomenon of  slow  convergence  and  limited  social  mobility  is  both  interesting 
and  important  in  itself.  There  are  a  number  of  other  interesting  predictions 

^^See  Jeong  and  Townsend  (2000). 
There  are  attempts  to  measure  the  contribution  of  genetics  to  economic  mobility.  See 
Bowles  and  Gintis  (2001),  for  example. 

Sec  Benabou  (1996)  for  an  excellent  review  of  this  literature. 
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of  the  imperfect  credit  market  model,  some  of  which  we  now  briefly  sketch; 

First,  poor  capital  markets  will  tend  to  be  associated  with  great  diversity 
in  firm  size  within  the  same  industry.  In  the  case  where  there  is  an  optimal 
scale  of  production,  we  would  expect  to  see  firms  that  are  both  below  and 
above  this  scale — the  former  because  they  are  capital  starved  and  the  latter 
because  overall  underinvestment  generates  rents  in  the  industry  and  as  a 
result  those  who  do  have  the  capital  overinvest  in  order  to  capture  those 
rents.  This  appears  to  be  consistent  with  the  oft-remarked  fact  that  most 
industries  in  developing  countries  have  both  large  firms  and  a  fringe  of  very 
small  firms. 

Second,  one  might  expect  to  find  some  very  diversified  firms  in  economies 
with  poor  capital  markets.  This  because  there  will  be  rents  in  most  indus- 
tries and  those  who  have  money  to  invest  can  capture  these  rents.  Therefore, 
the  extent  of  diversification  may  not  be  guided  by  the  usual  considerations 
of  competence  and  synergies.  The  Trading  Companies  in  Japan,  the  Chae- 
bols in  Korea  and  the  Managing  Agencies  in  India  are  potential  examples 
of  this  phenomenon. 

Third,  industry  may  be  clustered  in  certain  locations  that  have  little 
to  recommend  them  as  venues  for  that  industry.  This  is  because  informal 
lenders  prefer  to  lend  to  borrowers  they  can  monitor  easily  and  this  might 
lead  them  to  prefer  those  who  invest  locally  and  in  familiar  industries.  In 
Banerjee  and  Munshi  (2001),  we  argue  that  the  concentration  of  the  knit- 
ted garment  industry  in  Tirupur,  an  otherwise  unknown  town  with  poor 
infrastructure,  is  partly  a  result  of  the  fact  that  the  local  population  of 
Gounders  need  an  outlet  for  their  agricultural  surplus. 

Finally,  tied  transactions  will  be  very  important  in  this  world,  because 
tying  saves  on  the  costs  of  monitoring.^^  Thus,  sellers  will  be  the  preferred 
source  of  credit  for  buyers  (trade  credit)  and  employers  will  be  the  preferred 
lender  for  workers.  Cunat  (2000)  justifies  the  persistence  of  expensive  trade 
credit  in  OECD  countries  (the  standard  rate  in  the  U.S.  is  44%)  in  these 
terms. 

5      Other  Contracting  Problems 

Contracting  problems  in  the  land  market  are,  not  surprisingly,  very  similar 
to  the  problems  in  credit  markets.  The  models  predict  that  wealthier  tenants 

^'This  argument  goes  back  to  Bardhan  (1983). 
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will  get  more  efficient  contracts  and  more  land.^'^ 

Contracting  problems  faced  by  lenders  are,  of  course,  just  the  other  side 
of  the  contracting  problems  faced  by  borrowers.  But  the  models  of  poverty 
traps  typically  focus  on  the  borrowers  even  though  it  is  easy  to  see  how 
a  very  similar  story  would  apply  to  lenders.  In  this  story,  the  poor  will 
not  be  able  to  lend  to  individual  borrowers  because  there  are  fixed  costs  of 
monitoring,  but  intermediaries  in  the  credit  market  also  do  not  want  to  deal 
with  them  because  there  is  a  fixed  cost  of  collecting  money  from  them  (they 
have  to  meet,  for  example).  Therefore,  they  earn  very  low  returns  on  their 
savings:  Rutherford  (1999)  documents  numerous  examples  where  the  poor 
accept  substantial  negative  rates  of  return  in  order  to  put  their  savings  in 
a  safe  place.  Given  the  low  return  on  savings,  they  prefer  not  to  save  and 
stay  poor. 

This  story  is  empirically  at  least  as  plausible  as  any  other  story  of  the 
poverty  trap,  especially  given  that  the  poor  are  more  likely  to  be  lenders 
than  borrowers.  Yet  there  are  few  models  of  this  type.  Matsuyama  (2000) 
is  one  paper  that  does  take  this  issue  seriously,  but  his  focus  is  on  collective 
rather  than  individual  level  poverty  traps. 

Contracting  problems  in  the  insurance  market,  by  contrast,  can  be  quite 
unlike  those  in  credit  markets.  Insurance  is  a  key  market  for  the  poor 
because  they  may  be  extremely  vulnerable  to  even  small  changes  in  their 
consumption  (that  is,  they  are  likely  to  be  more  risk  averse  than  the  rich).^^ 
This  is  consistent  with  the  large  and  growing  literature  which  shows  that 
the  poor  enter  into  many  sophisticated  arrangements  in  order  to  limit  their 
risk  exposure. ^^  As  noted  by  Newman  (1995),  this  is  not  inconsistent  with 
the  possibihty  that  the  poor  are  treated  better  than  the  rich  in  the  optimal 
insurance  contract. '^'^  The  reason  comes  from  precisely  the  fact  that  the 
poor  have  more  to  lose.  This  means  the  threat  of  even  a  small  loss  can  give 
good  incentives  to  the  poor,  making  it  easier  to  give  them  good  insurance 
as  well.  Newman  goes  on  to  show  that  under  reasonable  assumptions,  this 
effect  can  be  so  strong  that  the  poor  would  be  prepared  to  take  on  risky  and 
profitable  projects  that  the  rich  will  avoid.  This  is,  of  course,  quite  different 
from  the  predictions  of  the  credit  market  model — there,  it  is  the  rich  who 
do  the  profitable  projects.    It  is  clearly  a  force  towards  convergence,  and 

^°This  phenomenon  is  well-known  in  the  empirical  literature  on  tenancy  and  goes  under 
the  name  of  "Tenancy  Ladder" . 

*'See,  for  example,  Deaton  (1989). 

''^See,  for  example,  Udry  (1994)  and  Townsend  (1995). 

^'in  the  sense  that  the  rich  would  want  to  have  the  insurance  contract  that  the  poor 
get  in  equilibrium. 
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explains  why  there  is  no  poverty  trap  in  the  model  of  Banerjee  and  Newman 
(1991)  on  risk  bearing.  Of  course,  this  kind  of  result  depends  crucially  on  the 
setup.  We  would  get  the  opposite  result  if  the  insurance  market  was  entirely 
absent,  perhaps  because  the  fixed  costs  of  enforcing  such  a  contract  are  too 
high.^'*  Then  there  could  easily  be  a  poverty  trap — in  such  a  model  the 
poor  would  underinvest  because  investment  is  risky  and  they  are  unwilling 
to  bear  any  risk.^^ 

The  contrast  between  the  credit  market  case  and  the  insurance  case 
does  not  end  here.  Keeping  the  insurance  contract  fixed,  increasing  risk 
exposure  (by  weakening  social  protection,  for  example)  and  increased  risk 
aversion  hurt  the  poor  in  the  insurance  context  but  may  actually  help  them 
get  more  credit  (since  it  makes  it  easier  for  the  lender  to  "punish"  them  for 
defaulting). 

The  source  of  this  conflict  lies  in  the  basic  premises  of  these  two  narra- 
tives: in  the  insurance  market  view,  the  emphasis  is  on  the  vulnerability  of 
the  poor,  i.e.,  on  the  fact  that  they  cannot  afford  any  losses;  while  in  the 
credit  market  view,  the  poor  are  seen  as  unreliable  because  they  have  too 
little  to  lose  and  therefore  cannot  be  punished  for  defaulting.  These  views 
are  not  necessarily  inconsistent:  For  example,  because  it  may  be  very  costly 
for  the  lender  to  inflict  losses  on  the  poor,  even  though  losses  hmrt  the  poor 
a  lot.^^  Or  the  fact  that  the  poor  have  too  little  to  lose,  and  therefore  un- 
able to  invest,  might  make  the  sightly  less  poor  extremely  averse  to  the  risk 
of  becoming  poor  and  therefore  unwilling  to  invest.^'^  In  other  words,  it  is 
possible  that  the  credit  market  and  insurance  market  views  reinforce  each 
other.  But  it  remains  that  the  tension  between  them  is  real  and  far  from 
being  resolved. 

Contracting  in  product  markets  is  less  studied  in  the  context  of  LDCs. 
However,  as  the  share  of  quality-sensitive  products  in  world  demand  grows 
(as  it  has  in  recent  years  because  of  the  growth  in  the  new  economy),  sellers 
in  less  developed  countries  will  have  to  be  able  to  assure  buyers  that  they 
are  getting  the  desired  quality.  Since  quality  is  not  easy  to  contract  on 
ex  ante,  this  will  raise  the  importance  of  appropriate  contract  design  in 
product  markets  for  LDCs.  It  will  also  make  reputation  and  brand  names 
much  more  valuable,  making  it  harder  for  new  entrants.  One  case  where  this 

''^See  Kanbur  (1979)  and  Kihlstrom  and  Laffont  (1979)  for  results  of  this  type. 

^^For  a  simple  model  of  a  poverty  trap  of  this  type  see  Banerjee  (2000).  Mordudi  (1995) 
and  Walker  and  Ryan  (1990)  provide  some  suggestive  evidence  for  the  view  that  the  poor 
are  discouraged  by  risk  from  taking  up  the  most  profitable  opportunities. 
See  Banerjee  (2000)  for  a  more  elaborate  discussion  of  these  issues. 

"As  in  Banerjee  (2000). 
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has  already  happened  is  the  Indian  software  industry,  where  Banerjee  and 
Duflo  (2000)  show  that  the  more  reputed  firms  both  get  better  contracts 
and  more  rewarding  projects. 

6     The  View  of  Policy 

If  the  evidence  and  arguments  Hsted  above  do  one  thing,  it  is  to  challenge 
the  hegemony  of  the  complete  markets  Arrow-Debreu  model  as  the  basis 
for  policy  thinking.  The  usual  view  in  economics  seems  to  be  that  the 
complete  markets  model  provides  the  natural  framework,  modified  perhaps 
by  acknowledging  some  hmited  role  for  transactions  costs.  My  view  is  that 
there  are  many  important  contexts  where  these  transaction  costs  are  so  large 
and  the  consequent  deviation  from  the  complete  markets  model  so  glaring, 
that  it  is  better  to  abandon  the  complete  markets  model,  except  in  as  much 
as  it  provides  a  useful  intellectual  point  of  reference.  To  take  an  example, 
in  the  market  studied  by  Aleem,  the  transaction  cost  of  about  50  cents  on 
a  dollar  clearly  swamps  the  interest  rate  paid  to  the  ultimate  lender  (the 
depositor  in  bank,  who  gets  10  cents).  But  this  is  only  the  observable  part 
of  the  transaction  cost.  Then  there  is  the  cost  of  missed  opportunities, 
because  in  addition  to  these  high  rates  the  lenders  probably  impose  credit 
limits  and,  moreover,  some  people  are  completely  excluded  from  the  market 
because  no  one  knows  or  trusts  them.^^  Then  there  are  dynamic  costs:  The 
fact  that  the  current  borrower  underinvests  (or  earns  low  net  returns  on  his 
investment)  means  that  his  son/daughter  will  also  be  poor  and  will  also  not 
be  able  to  take  advantage  of  his/her  opportunities  and  talents.  Finally,  there 
are  general  equilibrium  eflFects:  Inefficiency  in  investment  means  low  wages 
and  slow  capital  accumulation  today,  both  of  which  contribute  to  poverty 
and  inefficiency  tomorrow. 

The  rejection  of  the  complete  markets  model  should  not  be  seen  as  a 
justification  for  old-fashioned  dirigiste  pohces.  There  are,  of  course,  good 
reasons  to  worry  about  the  deliberate  misuse  of  these  policies.  But  per- 
haps equally  importantly,  the  recognition  that  markets  often  fail  does  not 
automatically  imply  that  we  should  pursue  anti-market  policies.  To  take 
an  example,  trade  protection  may  be  particularly  bad  if  capital  markets  are 
imperfect  because  it  reinforces  the  capital  market  frictions  that  slow  down 
the  flow  of  capital  toward  its  best  possible  uses. 

'"'There  is,  obviously  a  trade-off  here.  Some  lenders  may  limit  themselves  to  borrowers 
they  know  very  well,  which  brings  down  the  direct  transaction  cost  but  increases  the  costs 
coming  from  exclusion  and  missed  opportunities. 
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What  does  emerge  from  the  analysis  above  is  the  need  to  build  policies 
that  recognize  the  market  failures  that  are  most  important  in  the  particular 
context.  Thus,  assessing  the  growth  impact  of  trade  policies  without  taking 
account  of  the  distributional  impact  of  these  policies  is  self-defeating,  since 
the  distributional  impact  will  frame  the  future  pattern  of  investment. 

The  imperative  of  taking  specific  market  failures  seriously  when  making 
policy  is  obviously  rather  vague.  To  give  it  some  more  content,  I  will  now 
discuss  some  simple  examples  of  how  thinking  about  the  world  in  this  way 
feeds  into  specific  pohcy  recommendations. 

The  first  example,  already  alluded  to,  comes  from  credit  and  insurance 
markets.  As  we  saw,  the  imphcations  of  better  social  protection  tend  to  be 
very  different  in  models  of  credit  and  insurance.  This  needs  to  be  taken  into 
account  in  designing  social  protection  mechanisms. 

Second,  it  is  clearly  important  to  try  to  reduce  the  cost  of  credit  to 
the  poor.  One  idea  that  has  received  a  lot  of  currency  is  to  make  use  of 
peer  monitoring  and  screening  by  peers  through  micro-finance  institutions.^^ 
This  has  two  related  advantages.  First,  members  of  one's  peer  group  may 
be  better  at  monitoring  and  screening — this  reduces  the  cost  of  monitoring. 
Second,  the  usual  arrangements  involve  mutual  monitoring  on  a  quid  pro 
quo  basis.  Therefore,  the  interest  rate  does  not  have  to  be  raised  in  order 
to  pay  for  the  cost  of  monitoring.  Lower  interest  rates,  as  we  already  saw, 
generate  better  incentives  and  therefore  less  monitoring  is  needed.'*'  Other 
possible  interventions  include  trying  to  develop  systems  of  credit  rating  and 
centrahzing  credit  histories  so  that  the  credit  markets  become  less  segmented 
and  borrowers  have  access  to  the  cheapest  sources  of  credit.  Developing 
better  systems  for  recording  of  property  ownership  (so  that  the  assets  that 
the  poor  have  can  be  used  as  collateral)  and  a  court  system  that  resolves 
property  disputes  quickly  and  effectively  (so  that  lenders  believe  that  they 
can  collect  on  the  collateral),  will  also  help  the  poor.  Helping  the  poor  to 
develop  credit  histories  and  helping  them  to  learn  to  deal  with  the  financial 
system  more  generally  (by  keeping  proper  accounts,  for  example)  is  yet 
another  potentially  fruitful  avenue. ^^ 

Third,  giving  a  central  role  to  issues  of  credit  access  gives  a  new  urgency 

See  Banerjee,  Besley,  Guinanne  (1994)  for  a  model  of  lending  beised  on  peer  monitoring 
and  Ghatak  (1998)  for  a  model  of  screening  by  peers. 

This  argument  relies  on  the  assumption  that  the  time  spent  on  monitoring  has  no 
alternative  cash-generating  use. 

'^^This  is  one  way  to  interpret  the  contribution  of  many  of  the  NGOs  that  work  in 
micro-credit  but  focus  on  lending  to  individuals  rather  than  groups.  BRI  in  Indonesia  is 
a  well-known  example. 
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to  the  old  policy  prescription  of  encouraging  savings.  It  is  easy  to  see  that 
one  way  to  get  out  poverty  traps  is  to  raise  the  savings  rate.  In  Figures  1, 
2,  3  and  4,  this  pushes  up  the  map  from  current  wealth  to  future  wealth 
and  thereby  makes  poverty  traps  (both  individual  and  collective)  less  likely. 
More  generally,  as  already  discussed,  the  poor  may  save  too  little  (relative  to 
the  social  optimum)  when  capital  markets  are  imperfect,  both  because  the 
rates  paid  to  depositors  tend  to  be  too  low  (because  lending  is  difficult)  and 
because  they  do  not  have  the  wherewithal  to  become  investors  and  therefore 
put  their  savings  to  the  best  possible  use.  This  is  compounded  by  the  fact 
that  intermediaries  in  the  credit  market  may  bypass  them  for  the  simple  rea- 
son that  given  the  small  volume  of  their  savings,  the  fixed  cost  of  collecting 
savings  may  swamp  any  potential  returns  from  investing  them.  Subsidiz- 
ing access  to  savings  opportunities  may  therefore  be  a  powerful  weapon  for 
helping  the  poor. 

Fourth,  thinking  of  the  underlying  contract  theory  clearly  gives  us  a  very 
different  perspective  on  land  reform  and  tenancy  reform.  In  particular,  it 
tells  us  that  it  may  be  possible  to  achieve  many  of  the  desirable  productivity 
effects  of  such  reforms  without  actually  changing  any  land  rights.  One  ex- 
ample of  such  an  intervention  would  be  a  program  that  improves  the  outside 
options  of  the  tenants,  such  as  an  employment  guarantee  scheme.'"  As  the 
tenant's  outside  option  improves,  he  will  be  rewarded  more  and  that  can 
lead  to  an  improvement  in  his  incentives. '^^ 

Finally,  if  our  prediction  about  the  increasing  importance  of  product 
market  contracting  in  developing  countries  is  borne  out,  some  of  these  coun- 
tries will  have  to  make  significant  policy  changes  in  order  to  not  be  left 
out.  In  part,  this  will  involve  strengthening  the  court  system,  but  in  part 
also  it  will  require  other  innovations  such  as  helping  domestic  companies 
build  a  reputation  or  making  it  possible  for  domestic  producers  to  enter  into 
strategic  partnerships  with  reputable  MNCs  in  order  to  benefit  from  their 
reputation.'^' 


^■^The  strategy  of  improving  outside  options  is  called  an  empowerment  strategy  in  Baner- 
jee,  Gertler  and  Ghatak  (2000),  who  also  provide  a  formal  analysis  of  how  these  strategies 
work. 

For  a  more  detailed  analysis  of  land  reforms  and  alternatives  to  land  reform,  see 
Banerjee  (1999). 

'  In  many  cases  this  may  not  involve  much  more  than  allowing  foreign  equity  partici- 
pation in  domestic  companies.  However,  in  cases  where  countries  suffer  from  a  collective 
reputation  problem,  there  may  be  a  case  for  a  more  substantial  intervention:  penalizing 
domestic  companies  that  fail  to  meet  some  quality  standard  may  be  one  way  of  achieving 
this  (see  Tirole,  1996,  for  a  model  of  collective  reputation). 
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7     Conclusion 

I  have  made  the  choice  in  this  survey  of  focusing  on  one  rather  specific  topic 
in  order  to  explain  better  the  logic  of  how  research  in  this  area  of  develop- 
ment economics  has  evolved  over  the  last  twenty  years,  and  to  pinpoint  key 
gaps  in  our  knowledge  in  this  relatively  well-studied  area.  To  conclude,  let 
me  now  say  something  about  the  broader  agenda. 

The  one  most  important  limitation  of  my  survey  is  its  focus  on  the 
level,  distribution  and  growth  of  output  as  the  main  outcome  of  interest. 
Development  economics  is  much  broader:  there  is  a  large  and  important 
literature  which  attempts  to  explain  the  existence  and  persistence  of  institu- 
tions such  as  sharecropping,  the  village  money-lender,  ROSCAs,  community- 
based  lending  networks,  cooperatives,  etc.,  based  on  contract  theoretic  ar- 
guments.^^ There  is  also  a  long  tradition  which  argues  that  institutions  act 
as  an  independent  force  in  the  economy  and  influence  economic  outcomes. 
The  Lewis  model,  which  argued  that  the  particular  structure  of  the  implicit 
contract  in  family  farms  restrains  migration  and  growth,  is  perhaps  the  most 
well-known  example  of  a  model  of  this  type.  While  there  was  a  period  when 
economists  seemed  to  take  the  extreme  Coasian  view  that  inefficient  insti- 
tutions should  not  exist,  there  is  now  a  clearer  understanding  that  there 
is  no  good  reason  why  institutions  should  be  set  up  with  an  eye  towards 
global  optimality.  Banerjee  and  Newman  (1998)  present  an  example  where 
a  locally  efficient  institution  leads  to  inefficient  global  outcomes. '^^ 

What  are  still  rare  are  dynamic  models  where  the  institutions  themselves 
evolve.  Banerjee  and  Newman  (1993)  present  one  such  model,  aimed  at 
explaining  the  evolution  of  the  economic  institutions  of  capitalism  (large 
capitalist  firms  as  against  self-employment)  as  a  result  of  change  in  the 
wealth  distribution.'^'^  Greenwood  and  Jovanovic  (1990)  present  a  model 
where  the  financial  sector  evolves  with  growth. 

Even  rarer  are  models  where  institutions  not  only  evolve  but  actually 
have  feedback  effects  on  the  rest  of  the  economy.  Some  recent  examples 
include  Acemoglu  and  Robinson  (1998),  which  looks  at  the  evolution  of 
the  franchise,  Acemoglu  and  Zilibotti  (1997),  which  focuses  on  the  financial 
sector,  and  Banerjee  and  Newman  (1998),  which  looks  at  the  evolution  of  the 

'^^Bardhan  (1989)  and  Greif  (1997)  are  two  important  sources  for  this  literature.  Legros 
and  Newman  (1996),  Wells  (1999),  and  Prescott  and  Townsend  (2000)  are  interesting 
recent  examples  of  this  style  of  work. 

Douglas  North  has  been  the  most  influential  exponent  of  this  general  position  in  recent 
years.  See  Grief  (1994)  for  an  interesting  and  nuanced  statement  of  this  view. 

^^See  also  the  related  work  by  Lloyd-Ellis  and  Bernhardt  (2000). 
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modern  sector.  But  there  are  not  many  others,  even  though  it  seems  clear 
that  this  is  the  process  that  development  economists  would  like  to  capture. 
A  lot  has  been  done  in  the  last  twenty  years.  If  that  has  achieved  any- 
thing, it  is  to  make  us  aware  of  where  we  would  like  to  be  and  what  needs 
to  be  done.  Now,  I  hope,  it  is  only  a  matter  of  time. 
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